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I. REAL PARTY IN INTEREST 

The real party in interest is the assignee, Research Development Foundation. The subject 
matter of this application is licensed to Merck & Co., Inc. 

II. RELATED APPEALS AND INTERFERENCES 

There are no interferences or appeals for related cases. 

III. STATUS OF THE CLAIMS 

Claims 1-7, 9-11, 13, 41, 46 and 47 are pending and under examination. Claims 8, 12, 
14-40, 42-45 and 48-84 are withdrawn. No claims are canceled. Claims 1-7, 9-11, 13, 41, 46 
and 47 are the subject of the present appeal. 

IV. STATUS OF AMENDMENTS 
There are no pending amendments. 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 

The invention, as set forth in independent claim 1, concerns a polynucleotide construct 
comprising a region encoding a siRNA operably linked to an externally controllable RNA 
polymerase III promoter, wherein expression of the siRNA is regulated by a polypeptide 
regulator having both a DNA binding domain and a repressor domain. See specification at page 
5, lines 14-21 and page 8, lines 8-22. 

The invention as set forth in independent claim 41 concerns a mammalian cell 
comprising a polynucleotide construct comprising a region encoding a siRNA operably linked to 
an externally controllable RNA polymerase III promoter, wherein expression of the siRNA is 
regulated by a polypeptide regulator having both a DNA binding domain and a repressor domain, 
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said polypeptide regulator being encoded by said cell. Specific examples including 
undifferentiated cells (claim 46) and oocytes (claim 47). See specification at pages 1 1 and 12. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

There are four rejections that are the subject of the present appeal: 

1) A rejection of claims 41, 46 and 47 as lacking written description under 35 U.S.C. 
§112, first paragraph; 

2) A rejection of claims 1-7, 9-11, 13, 41 and 46-47 as lacking written description 
under 35 U.S.C. §112, first paragraph; 

3) A rejection of claims 41, 46 and 47 as unpatentable under 35 U.S.C. §101; 

4) A rejection of claims 1-7, 9-11, 13, 41 and 46-47 as unpatentable under 35 U.S.C. 
§103 over Yao et al ("Yao"; Exhibit 1), Verma et al ("Verma"; Exhibit 2), Elbasher et al 
("ElbasherA"; Exhibit 3), Elbasher et a/./Nature ("ElbasherB"; Exhibit 4) and Deuschle et al 
("Deuschle"; Exhibit 5); and 

5) A rejection of claims 1-7, 9-1 1, 13, 41 and 46-47 as unpatentable under 35 U.S.C. 
§103 over Giordano et al. ("Giordano"; Exhibit 6), ElbasherB and Deuschle and Verma. 

VII. ARGUMENT 

A. Rejection of claims 41, 46 and 47 as lacking written description under 35 
U.S.C. §112, first paragraph 

The Final Action first rejected claims 41, 46 and 47 as lacking written description under 
35 U.S.C. §112 taking the position that the phrase "wherein expression of the siRNA is regulated 
by a polypeptide regulator having both a DNA binding domain and a repressor domain, said 
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polypeptide regulator begin encoded by said cell" constitutes new matter. In particular, the 
Action posits that the originally filed specification fails to disclose the polypeptide regulator 
being encoded by a cell. In response, we would note that there are many, many places in the 
specification where this is discussed. Indeed, it is difficult to visualize carrying out this aspect of 
the invention without the polypeptide regulator being encoded by the cell. 

First, we would direct the Board's attention to excerpts which teach the general concept 
of expressing poplypeptide regulator, exemplified by tTR-KRAB, by cells to effect controlled 
expression of siRNA by the cell: 

Expression of tTR-KRAB or other regulatable system may be conditional, 
inducible, tissue-specific, constitutive, or locally utilized (such as locally applied). 
A founder cell or cell line can be used for introduction of a nucleic acid sequence 
containing a sequence of interest under the control of regulatory element for the 
transcriptional fusion protein (effector polynucleotide), such as tTR-KRAB. 

Specification, page 12, lines 19-23. 

Next, we would direct the Board's attention to the paragraph bridging pages 20 and 21, 
wherein the specification describes the preparation of transgenic animals, wherein cells of the 
transgenic animals are transfected with the coding sequence for the polypeptide regulator: 

Furthermore, such animals may also be regulated in a conditional and 
tissue-specific manner comprising: a) obtaining a first transgenic animal having a 
Cre recombinase-encoding polynucleotide under the control of a tissue-specific 
promoter; b) obtaining a second transgenic animal having i) an siRNA-encoding 
nucleic acid segment under the control of a regulatable promoter region, 
wherein the siRNA corresponds to the target gene; and, ii) the polynucleotide 
encoding the regulatable polypeptide regulator, and, c) mating opposite sexes of 
the first and second animals. In some embodiments, the second transgenic animal 
is obtained by: d) transfecting an undifferentiated mammalian cell with i) a 
regulatable siRNA-expression construct comprising an siRNA encoding nucleic 
acid segment and a regulatable promoter region, wherein an excisable fragment 
is located between the segment and the regulatable promoter region; and ii) a 
polynucleotide encoding a regulatable polypeptide regulator of the regulatable 
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siRNA-expression construct; e) fertilizing the cell if the cell has a haploid 
genome; and, f) transplanting the embryo into a female animal, wherein the 
female animal produces the second transgenic animal. In some cases, an 
undifferentiated cell is an unfertilized oocyte, a fertilized oocyte, an embryonic 
stem cell, a cell within a morula or blastocyst. Moreover, methods can also 
involve culturing the cell prior to transfection and/or transplantation. Further, such 
methods can involve conventional matings with transgenic animals of the 
invention. 

Specification, para. Bridging pages 20-21. 

The foregoing concept is also embodied in the originally filed claims: 

42. The mammalian cell of claim 41 , further defined as comprising: 

(a) a first polynucleotide sequence comprising a promoter operably 
linked to at least one nucleic acid segment encoding an siRNA; 

(b) a second polynucleotide sequence encoding a conditional repressor 
fusion protein that comprises a DNA binding domain and a 
transcription repression domain; and 

(c) a third polynucleotide sequence bindable by the binding domain of 
the fusion protein of (b) and positioned such that the transcription 
repression domain acts to repress transcription of the nucleic acid 
segment of (a). 

43. The mammalian cell of claim 41, wherein the conditional repressor fusion 
protein is drug inducible. 

The foregoing claims evidence the fact that the application as originally filed 
contemplated that the target cells would include and express both the polynucleotide that 
encoded the siRNA, but also a separate polynucleotide that encoded the fusion protein regulatory 
element (i.e., the "polypeptide regulator"). 
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B. Rejection of claims 1-7, 9-11, 13, 41 and 46-47 as lacking written description 
under 35 U.S.C. §112, first paragraph 

The Action next rejects claims 1-7, 9-11, 13, 41 and 46-47, taking the position that the 
concept of the polypeptide regulator having both a DNA binding domain and a repressor domain 
is insufficiently described in the specification. 

In response, we would note that the specification has a detailed description of various 
examples of DNA binding domains and repressor domains that can be used in the practice of the 
invention, as can be seem: 

In a specific embodiment, the KRAB domain does not come from Kox-1 
but from another zinc-finger protein-containing KRAB domain. Thus, in one 
embodiment, the KRAB repression domain from the human KOX-1 protein is 
used as a transcriptional repressor (Thiesen et aL, 1990; Margolin et aL, 1994; 
Pengue et aL, 1994; Witzgall et aL, 1994). In another embodiment, KAP-1, a 
KRAB co-repressor, is used with KRAB (Friedman et aL, 1996), either as part of 
the same fusion protein or provided separately. Alternatively, KAP-1 can be used 
alone with a zinc finger protein. 



In addition to the exemplary tTR-KRAB fusion protein as an external 
agent-inducible repressor fusion protein, other fusion proteins comprising 
different DNA binding domains and transcriptional repressor domains may be 
utilized. In embodiments of the present invention, a DNA binding domain is 
utilized as part of a drug-inducible regulatory fusion protein, and the DNA- 
binding domain may include sequences such as the DNA-binding domains of the 
tetracycline repressor (tTR), or those of GAL4 or LexA, for example. 

In embodiments of the present invention wherein a repressor domain is 
utilized as part of an external agent-inducible regulatory fusion protein, the 
repressor domain is the Kruppel-associated box domain (KRAB). However, other 
repressor domains include ERD or SID transcriptional repressor domains, for 
example. Other preferred transcription factors and transcription factor domains 
that act as transcriptional repressors include, for example, MAD (see, e.g., 
Sommer et aL, 1998; Gupta et aL, 1998; Queva et aL, 1998; Larsson et aL, 1997; 
Laherty et aL, 1997; and Cultraro et aL, 1997); FKHR (forkhead in 
rhapdosarcoma gene; Ginsberg et aL, 1998; Epstein et aL, 1998); EGR-1 (early 
growth response gene product- 1; Yan et aL, 1998; and Liu et aL, 1998); the ets2 
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repressor factor repressor domain (ERD; Sgouras et al, 1995); and the MAD 
smSIN3 interaction domain (SID; Ayer et al, 1996). 

Specification, pages 8, 10-1 1. 

From the foregoing, it is evident that the specification sets forth both a large number of 
known DNA binding domains and separate repressor domains that can be employed in the 
practice of the invention in order to provide for expression of the polypeptide regulator. 

The Action also takes that position that the specification does not adequately describe the 
"polypeptide regulator." We suspect that the Examiner may be a bit confused as to what is 
intended by the polypeptide regulator. As noted in the foregoing section, the polypeptide 
regulator is simply the regulatory fusion protein that includes both the DNA binding domain and 
the repressor domain. An example is the "tTR-KRAB fusion protein" but as can be seen from 
the foregoing excerpts, the specification describes many other possible examples. 

The Action further complains that the specification fails to adequately describe examples 
of "externally controllable promoters." Again, we strenuously disagree in that "externally 
controllable promoters" are exceedingly well known in the art and described in the Summary of 
specification: 

In a particular embodiment, the invention regards a polynucleotide 
construct comprising a region encoding a siRNA operably linked to an externally 
controllable promoter, and the construct may be further defined as a vector. A 
non-limiting example of a vector includes a lentiviral vector, a retroviral vector, 
an MLV vector, an AAV vector, a plasmid vector or an adenoviral vector. The 
externally controllable promoter may be a repressible promoter whereby 
expression of the encoded siRNA can be downregulated by means of an 
externally applied agent. The expression of the encoded siRNA can be 
downregulated by means of an externally applied drug. 

In specific embodiments, the repressible promoter is regulated by a Tet 
repressor and/or is defined as further comprising at least one tetO sequence. The 
repressible promoter may be regulated by the lad repressor, or the repressible 
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promoter may be from the gene of ANB1, HEM 13, ERG 11, OLE 1, GAL1, 
GAL 10, ADH2, or TET R 



In particular embodiments, the externally controllable promoter is an 
inducible promoter whereby expression of the encoded siRNA can be upregulated 
by means of an externally applied agent. Inducible promoters may be inducible 
by Cu 2+ , Zn 2+ , tetracycline, tetracycline analog, ecdysone, glucocorticoid, 
tamoxifen, or an inducer of the lac operon. The promoter may be inducible by 
ecdysone, glucocorticoid, or tamoxifen. In specific embodiments, the inducible 
promoter is a phage inducible promoter, nutrient inducible promoter, temperature 
inducible promoter, radiation inducible promoter, metal inducible promoter, 
hormone inducible promoter, steroid inducible promoter, or combination thereof. 
Examples of radiation inducible promoters include fos promoter, jun promoter, or 
erg promoter. 

Systems for the regulation of gene expression that may be used within the 
contemplated scope of the invention include regulatory systems utilizing 
compounds such as progesterone, estrogen, and/or ecdysone. 

Specification, pages 5-6. 

Furthermore, we would direct the Board to the 14-page section of the specification 
entitled "Controlled expression of siRNA constructs" at pages 29-42 which includes a very 
detailed, exemplary listing and description of possible control promoters that could be employed 
in the practice of the invention. 

The foregoing is consistent with Federal Circuit caselaw interpreting the law of written 

description in the context of biotech inventions such as the present one. Instructive in this regard 

is the Federal Circuit's recent decision in Capon v. Eshhar v. Dudas, 418 F.3d 1349, 76 USPQ2d 

1078 (Fed. Cir. 2005). As the Capon court points out, there is no requirement under written 

description that a specification contain a detailed description of elements where those elements 

are well known to those in the field: 

The Board stated that "controlling precedent" required inclusion in the 
specification of the complete nucleotide sequence of "at least one" chimeric gene. 
Bd. op. at 4. The Board also objected that the claims were broader than the 
specific examples. Eshhar and Capon each responds by pointing to the scientific 
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completeness and depth of their descriptive texts, as well as to their illustrative 
examples. The Board did not relate any of the claims, broad or narrow, to the 
examples, but invalidated all of the claims without analysis of their scope and the 
relation of claim scope to the details of the specifications. 

Eshhar and Capon both argue that they have set forth an invention whose 
scope is fully and fairly described, for the nucleotide sequences of the DNA in 
chimeric combination is readily understood to contain the nucleotide sequences of 
the DNA components. Eshhar points to the general and specific description in his 
specification of known immune-related DNA segments, including the examples of 
their linking. Capon points similarly to his description of selecting DNA segments 
that are known to express immune-related proteins, and stresses the existing 
knowledge of these segments and their nucleotide sequences, as well as the 
known procedures for selecting and combining DNA segments, as cited in the 
specification. 

Both parties argue that the Board misconstrued precedent, and that 
precedent does not establish a per se rule requiring nucleotide-by-nucleotide re- 
analysis when the structure of the component DNA segments is already known, or 
readily determined by known procedures. The "written description" requirement 
implements the principle that a patent must describe the technology that is sought 
to be patented; the requirement serves both to satisfy the inventor's obligation to 
disclose the technologic knowledge upon which the patent is based, and to 
demonstrate that the patentee was in possession of the invention that is claimed. 
See Enzo Biochem, 296 F.3d at 1330 (the written description requirement "is the 
quid pro quo of the patent system; the public must receive meaningful disclosure 
in exchange for being excluded from practicing the invention for a limited period 
of time"); Reiffin v. Microsoft Corp,, 214 F.3d 1342, 1345-46 (Fed. Cir. 2000) 
(the purpose of the written description requirement "is to ensure that the scope of 
the right to exclude . . . does not overreach the scope of the inventor's contribution 
to the field of art as described in the patent specification"); In re Barker, 559 F.2d 
588, 592 n.4 (CCPA 1977) (the goal of the written description requirement is "to 
clearly convey the information that an applicant has invented the subject matter 
which is claimed"). The written description requirement thus satisfies the policy 
premises of the law, whereby the inventor's technical/scientific advance is added 
to the body of knowledge, as consideration for the grant of patent exclusivity. 

The descriptive text needed to meet these requirements varies with the 
nature and scope of the invention at issue, and with the scientific and technologic 
knowledge already in existence. The law must be applied to each invention that 
enters the patent process, for each patented advance is novel in relation to the 
state of the science. Since the law is applied to each invention in view of the state 
of relevant knowledge, its application will vary with differences in the state of 
knowledge in the field and differences in the predictability of the science. 

For the chimeric genes of the Capon and Eshhar inventions, the law must 
take cognizance of the scientific facts. The Board erred in refusing to consider the 
state of the scientific knowledge, as explained by both parties, and in declining to 
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consider the separate scope of each of the claims. None of the cases to which the 
Board attributes the requirement of total DNA re-analysis, i.e., Regents v. Lilly 9 
Fiers v. Revel, Amgen, or Enzo Biochem, require a re-description of what was 
already known. In Lilly, 119 F.3d at 1567, the cDNA for human insulin had never 
been characterized. Similarly in Fiers, 984 F.2d at 1 171, much of the DNA sought 
to be claimed was of unknown structure, whereby this court viewed the breadth of 
the claims as embracing a "wish" or research "plan." In Amgen, 927 F.2d at 1206, 
the court explained that a novel gene was not adequately characterized by its 
biological function alone because such a description would represent a mere 
"wish to know the identity" of the novel material. In Enzo Biochem, 296 F.3d at 
1326, this court reaffirmed that deposit of a physical sample may replace words 
when description is beyond present scientific capability. In Amgen Inc. v. Hoechst 
Marion Roussel, Inc., 314 F.3d 1313, 1332 (Fed. Cir. 2003) the court explained 
further that the written description requirement may be satisfied "if in the 
knowledge of the art the disclosed function is sufficiently correlated to a 
particular, known structure." These evolving principles were applied in Noelle v. 
Lederman, 355 F.3d 1343, 1349 (Fed. Cir. 2004), where the court affirmed that 
the human antibody there at issue was not adequately described by the structure 
and function of the mouse antigen; and in University of Rochester v. G.D. Searle 
& Co., 358 F.3d 916, 925-26 (Fed. Cir. 2004), where the court affirmed that the 
description of the COX-2 enzyme did not serve to describe unknown compounds 
capable of selectively inhibiting the enzyme. 

The "written description" requirement must be applied in the context of 
the particular invention and the state of the knowledge. The Board's rule that the 
nucleotide sequences of the chimeric genes must be fully presented, although the 
nucleotide sequences of the component DNA are known, is an inappropriate 
generalization. When the prior art includes the nucleotide information, precedent 
does not set a per se rule that the information must be determined afresh. Both 
parties state that a person experienced in the field of this invention would know 
that these known DNA segments would retain their DNA sequences when linked 
by known methods. Both parties explain that their invention is not in discovering 
which DNA segments are related to the immune response, for that is in the prior 
art, but in the novel combination of the DNA segments to achieve a novel result. 

The "written description" requirement states that the patentee must 
describe the invention; it does not state that every invention must be described in 
the same way. As each field evolves, the balance also evolves between what is 
known and what is added by each inventive contribution. Both Eshhar and Capon 
explain that this invention does not concern the discovery of gene function or 
structure, as in Lilly. The chimeric genes here at issue are prepared from known 
DNA sequences of known function. The Board's requirement that these sequences 
must be analyzed and reported in the specification does not add descriptive 
substance. The Board erred in holding that the specifications do not meet the 
written description requirement because they do not reiterate the structure or 
formula or chemical name for the nucleotide sequences of the claimed chimeric 
genes. 
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Applicants highlight a quote from the preceding passage: "None of the cases to which the Board 
attributes the requirement of total DNA re-analysis, i.e., Regents v. Lilly 9 Fiers v. Revel, Amgen, 
or Enzo Biochem, require a re-description of what was already known." While Capon dealt with 
DNA sequences per se, we submit that the reasoning is fully consistent with the present case, 
which concerns genetic control elements. 

The Capon case has very recently been followed by the Federal Circuit, in Falkner v. 
Inglis, App. No. 05-1324, decided May 26, 2006 (Fed. Cir. 2006) (copy enclosed). In a section 
of the opinion entitled "Recitation of Known Structure Is Not Required" the Falkner court, 
following the Capon decision, stated: 

Indeed, a requirement that patentees recite known DNA structures, if one existed, 
would serve no goal of the written description requirement. It would neither 
enforce the quid pro quo between the patentee and the public by forcing the 
disclosure of new information, nor would it be necessary to demonstrate to a 
person of ordinary skill in the art that the patentee was in possession of the 
claimed invention. . . . Accordingly, we hold that where, as in this case, accessible 
literature sources clearly provided, as of the relevant date, genes and their 
nucleotide sequences (here "essential genes"), satisfaction of the written 
description requirement does not require either the recitation or incorporation by 
reference (where permitted) of such genes and sequences. 

Id. at page 17-18. While Falkner dealt with nucleotide and gene sequences, the reasoning is 

certainly applicable here as well. 

Thus, it is submitted that there is clearly no prima facie basis for the subject written 

description rejection. 

C. Rejection of claims 41, 46 and 47 as unpatentable under 35 U.S.C. §101. 

The Action next rejects claims 41, 46 and 47 as directed to non-statutory subject matter in 
that the claims are said to read on transgenic humans. 
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In response, Applicants would note that while the claim could potentially cover some 
future transgenic human, the claims are not "directed to" a transgenic human and thus is not 
"directed to" non-statutory subject matter. 

Following the Examiner's logic, a claim directed to a transgenic DNA sequence could 
also be read to cover some hypothetical transgenic human, and yet such claims are routinely 
issued as are claims to transgenic mammalian cells. 

Further with respect to claims 46 (undifferentiated cells) and 47 (oocytes), these claims 
cannot be interpreted to read on a whole human. 

D. Rejection of claims 1-7, 9-11, 13, 41 and 46-47 as unpatentable under 35 
U.S.C. §103 over Yao, Verma, ElbasherA, ElbasherB and Deuschle. 

Next, the Action rejects all of the pending and examined claims as unpatentable under 35 

U.S.C. §103 over Yao (Exhibit 1), Verma (Exhibit 2), ElbasherA (Exhibit 3), ElbasherB (Exhibit 

4) and Deuschle (Exhibit 5). 

Appellants note that Verma teaches lentiviruses in general, and their use in expressing 
heterologous nucleic acids, but observe that Verma is admittedly silent with respect to the 
expression of siRNA expression constructs operably linked to an externally controllable RNA 
polymerase III promoter, wherein expression of the siRNA is regulated by a polypeptide 
regulator having both a DNA binding domain and a repressor domain. 

The Yao reference teaches nucleic acid constructs, including retroviruses, for expressing 
heterologous sequences under the control of a Tet repressor, but is also silent with respect to the 
expression of siRNA expression constructs operably linked to an externally controllable RNA 
polymerase III promoter, wherein expression of the siRNA is regulated by a polypeptide 
regulator having both a DNA binding domain and a repressor domain. 
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The two Elbashir references relate generally to siRNA expression constructs, but do not 
appear to in any way teach or suggest their being operably linked to an externally controllable 
RNA polymerase III promoter, wherein expression of the siRNA is regulated by a polypeptide 
regulator having both a DNA binding domain and a repressor domain. Indeed, we have been 
unable to identify any teaching or suggestion to control siRNA expression through the use of any 
externally controllable promoter, much less one that has both a DNA binding domain and a 
repressor domain. 

Lastly, the Deuschle reference discloses an externally controllable promoter having a 
DNA binding domain and a repressor domain (TetR-KRAB), but we have been unable to find 
any teaching here relevant to the use of such a promoter in the context of an siRNA under the 
control of a pol III promoter as specified in the claims. For example, the TetR-KRAB constructs 
is employed here only in the context of the CMV promoter, which is a pol II not a pol III 
promoter. Further, the only heterologous gene that Appellant's have identified in Deuschle that 
is placed under the control of TetR-KRAB is the firefly luciferase gene. 

Thus, even if the references are properly combinable (which, as discussed below, we 
dispute) they nevertheless fail to teach or suggest the invention as claimed in claim 1, in that the 
combination fails to teach a polynucleotide construct comprising a region encoding a siRNA 
operably linked to an externally controllable RNA polymerase III promoter, wherein expression 
of the siRNA is regulated by a polypeptide regulator having both a DNA binding domain and a 
repressor domain as required by independent claims 1 and 41. 

However, we further submit that the Examiner has failed to demonstrate that the 
references are properly combinable. It appears that the only references relied upon that concern 
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siRNA, the Elbasher references, fail to teach the concept of controlled expression of siRNA. 

Indeed, we have been unable to identify any such teaching in any of the references relied upon 

by the Examiner. Accordingly, we fail to understand the Examiner's prima facie basis for 

combining the Elbasher references with the other references as proposed. The same can be said 

for the other references. Deuschle teaches a CMV promoter that is regulated by TetR-KRAB, 

but the Examiner has failed to explain how this reference is combinable with references that 

concern other required aspects of the claims, such as siRNA expression. Although not entirely 

clear from the record, the Examiner may be attempting to rely on the Verma's disclosure of 

antisense and ribozyme expression (col. 7). The same can be said for Yao's teaching of 

controlled expression of antisense molecules (page 20 "D"). However, antisense and ribozymes 

are dramatically different from siRNAs, which is seen most notably in the fact that the latter is 

expressed from pol III promoters, not pol II promoters, and in the fact that their underlying 

mechanisms of action are quite distinct. 

Here, the Examiner has failed to enunciate any motivation to combine the references in 

the manner suggested, and has failed, for example, to make a showing that it is within the 

"common knowledge and common sense" to expand the cited references to include the concept 

of using a pol III promoter in this construct. Dystar Textilfarben Gmbh v. C H. Patrick Co., 

2006 U.S. App. LEXIS 24642 (Fed. Cir. 2006). The Court of Appeals for the Federal Circuit 

stated this concept succinctly in In re Kotzab: 

. . . Even when obviousness is based on a single prior art reference, there must be a 
showing of a suggestion or motivation to modify the teachings of that reference. 
See B. F. Goodrich Co. v. Aircraft Braking Sys. Corp., 72 F.3d 1577, 1582, 37 
U.S.P.Q. 2d 1314, 1318 (Fed. Cir. 1996). 

The motivation, suggestion or teaching may come explicitly from 
statements in the prior art, the knowledge of one of ordinary skill in the art, or, in 
some cases the nature of the problem to be solved. See Dembiczak, 175 F.3d at 
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999, 50 U.S.P.O. 2D (BNA) at 1617. In addition, the teaching, motivation or 
suggestion may be implicit from the prior art as a whole, rather than expressly 
stated in the references. See WMS Gaming, Inc. v. International Game Tech., 184 
F.3d 1339, 1355, 51 U.S.P.O. 2D (BNA) 1385, 1397 (Fed. Cir. 1999). The test 
for an implicit showing is what the combined teachings, knowledge of one of 
ordinary skill in the art, and the nature of the problem to be solved as a whole 
would have suggested to those of ordinary skill in the art. See In re Keller, 642 
F.2d 413, 425, 208 U.S.P.O. (BNA) 871, 881 fCCPA 1981) (and cases cited 
therein). Whether the Board relies on an express or an implicit showing, it must 
provide particular findings related thereto. See Dembiczak, 175 F.3d at 999, 50 
U.S.P.O. 2D (BNA) at 1617. Broad conclusory statements standing alone are not 
"evidence." Id. 

In re Kotzab, 217 F.3d. 1365, 1370, 55 U.S.P.Q. 2d 1313, 1317 (emphasis in original). 

Thus, to make out a proper prima facie obviousness rejection, the Examiner must 
adequately explain the motivation to combine separate teachings directed to 1) externally 
regulatable siRNA expression, 2) under the control of an RNA polymerase III promoter, 3) 
where the siRNA expression is regulated by a polypeptide regulator having both a DNA binding 
domain and a repressor domain (such as TetR/KRAB). See MPEP §2143.01 The mere fact that 
references "can be combined or modified" or that the claimed invention is "within the 
capabilities of one of ordinary skill" is clearly insufficient. Id. at sections III and IV. 
Furthermore, the Examiner must demonstrate on the record that one of ordinary skill had a 
reasonable expectation that such a combination would be successful. MPEP §2143.02 
("Reasonable expectation of success required"). Neither of the foregoing has been set forth by 
the Examiner. 

Claims 46 and 47 Are Separately Patentable 

The subject matter of claims 46 and 47 are even further removed from the prior art. 

Claim 46 is directed to an undifferentiated mammalian cell that comprises a 
polynucleotide construct comprising a region encoding a siRNA operably linked to an externally 
controllable RNA polymerase III promoter, wherein expression of the siRNA is regulated by a 
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polypeptide regulator having both a DNA binding domain and a repressor domain, said 
polypeptide regulator being encoded by said cell. The Examiner apparently relies on Yao for 
such a suggestion. Action at 8 (citing page 15 of Yao). However, the section of Yao relied upon 
by the Examiner is very clearly only concerned with a "Method for Recombinantly Producing 
Protein" (emphasis ours). Thus, we see no basis in Yao (or any of the other references) for any 
suggestion to employ externally controlled siRNA in the context of undifferentiated cells such as 
stem cells. 

Claim 47 is directed to oocytes or fertilized oocytes that comprise a externally 
controllable siRNA expression construct and are even still further removed. Applicant's have 
been unable to identify what teaching, if any, is being relied upon with respect to the subject 
matter of claim 47. 

C A rejection of claims 1-7, 9-11, 13, 41 and 46-47 as unpatentable under 35 
U.S.C. §103 over Giordano et al. ("Giordano"; Exhibit 6), ElbasherB and 
Deuschle and Verma. 

Lastly, the Action rejects claims 1-7, 9-11, 13, 41 and 46-47 over many of the same 

references, but with the addition of the Giordano reference. The Action indicates that Giordano 

is cited as teaching inducible and repressible transcription systems that can be used to control the 

timing of the expression of dsRNA from, for example, retroviral vectors, and that an example of 

such a construct is one incorporating the Tet promoter. Action at pages 12-13. 

With respect to all of the rejections except Giordano, we incorporate the arguments set 
forth in the preceding section. With respect to Giordano, we have reviewed this reference and 
have been unable to find any teaching or suggestion of an siRNA operably linked to an 
externally controllable RNA polymerase III promoter, wherein expression of the siRNA is 
regulated by a polypeptide regulator having both a DNA binding domain and a repressor domain. 
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While it does appear to teach a DNA binding domain, such as is found in the TetR, we can not 
find any suggestion to use a polypeptide regulator having both a DNA binding domain and a 
repressor domain such as TetR/KRAB. Furthermore, this reference teaches away from using 
such a polypeptide regulator for inhibiting RNA polymerase III. ("...these factors carry protein 
domains that transactivate or transrepress the RNA polymerase //." Col 33, line 19-21 (emphasis 
ours)). 

For the foregoing reasons, the Examiner has still failed to make out a proper prima facie 
obviousness rejection for the same reasons as set forth in the previous section of this brief. 

VIII. CONCLUSION 

Appellants believe that the foregoing remarks fully respond to all outstanding matters for 
this application. Appellants respectfully request that the Board reverse the rejections of all 
claims. ^ \ 
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I. CLAIMS APPENDIX 



1. (Previously presented) A polynucleotide construct comprising a region encoding a 
siRNA operably linked to an externally controllable RNA polymerase III promoter, wherein 
expression of the siRNA is regulated by a polypeptide regulator having both a DNA binding 
domain and a repressor domain. 

2. (Original) The construct of claim 1, further defined as a vector. 

3. (Original) The construct of claim 2, wherein the vector is a lentiviral vector, a retroviral 
vector, an MLV vector, an AAV vector, a plasmid vector or an adenoviral vector. 

4. (Original) The construct of claim 1, wherein the externally controllable promoter is a 
repressible promoter whereby expression of the encoded siRNA can be downregulated by means 
of an externally applied agent. 

5. (Original) The construct of claim 4, wherein expression of the encoded siRNA can be 
downregulated by means of an externally applied drug. 

6. (Previously presented) The construct of claim 1, wherein the externally controllable 
promoter is a repressible promoter that is regulated by a Tet repressor which further comprises a 
DNA binding domain. 
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7. (Previously presented) The construct of claim 1, wherein the externally controllable 
promoter is a repressible promoter comprising at least one tetO sequence. 

8. (Withdrawn) The construct of claim 1, wherein the repressible promoter is regulated by 
the lad repressor. 

9. (Previously presented) The construct of claim 1, wherein the externally controllable 
promoter is a repressible promoter from the gene of ANB1, HEM 13, ERG 11, OLE 1, GAL1, 
GAL 10, ADH2, or TET R . 

10. (Original) The construct of claim 1, wherein the externally controllable promoter is an 
inducible promoter whereby expression of the encoded siRNA can be upregulated by means of 
an externally applied agent. 

11. (Original) The construct of claim 10, wherein the inducible promoter is inducible by 
Cu +2 , Zn 2+ , tetracycline, tetracycline analog, ecdysone, glucocorticoid, tamoxifen, or an inducer 
of the lac operon. 

12. (Withdrawn) The construct of claim 1 1, wherein said promoter is inducible by ecdysone, 
glucocorticoid, or tamoxifen. 



13. (Original) The construct of claim 10, wherein said inducible promoter is a phage 
inducible promoter, nutrient inducible promoter, temperature inducible promoter, radiation 
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inducible promoter, metal inducible promoter, hormone inducible promoter, steroid inducible 
promoter, antibiotic inducible promoter, or combination thereof. 

14. (Withdrawn) The construct of claim 13, wherein said radiation inducible promoter is a 
fos promoter, a jun promoter, or an erg promoter. 

15. (Withdrawn) A system for controlling gene expression comprising: 

(a) one or more polynucleotide constructs in accordance with claim 1 ; 

(b) a compound or reagent that may be administered to the cell that directly or 
indirectly controls the expression of the siRNA. 

16. (Withdrawn) The system of claim 15, further defined as comprising: 

a) a regulatable siRNA-expression construct comprising an siRNA encoding nucleic 
acid segment potentially under the control of a regulatable promoter region; and 

b) a polynucleotide encoding a regulatable polypeptide regulator of the regulatable 
siRNA-expression construct. 

17. (Withdrawn) The system of claim 15, wherein an excisable fragment is located between 
the segment and the regulatable promoter region. 

1 8. (Withdrawn) The system of claim 17, wherein the expression of an agent that excises 
said excisable fragment is under the control of a tissue-specific promoter. 
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19. (Withdrawn) The system of claim 16, wherein the regulatable siRNA-expression 
construct is an integrating viral vector. 

20. (Withdrawn) The system of claim 1 8, wherein the integrating viral vector is a lentivirus, 
a retrovirus, or an adeno-associated virus. 

21. (Withdrawn) The system of claim 19, wherein the integrating viral vector is a lentivirus. 

22. (Withdrawn) The system of claim 16, wherein the regulatable siRNA-expression 
construct comprises a marker to assay for integration. 

23. (Withdrawn) The system of claim 16, wherein the regulatable promoter region comprises 
a nucleic acid segment that negatively regulates transcription from the promoter. 

24. (Withdrawn) The system of claim 16, wherein the nucleic acid segment that negatively 
regulates transcription from the promoter is a let operator. 

25. (Withdrawn) The system of claim 22, wherein the regulatable polypeptide regulator is 
capable of binding the tet operator. 

26. (Withdrawn) The system of claim 18, wherein the tissue specific promoter is liver fatty 
acid binding (FAB) protein gene promoter, insulin gene promoter, transphyretin promoter, al- 
antitrypsin promoter, plasminogen activator inhibitor type 1 (PAI-1) promoter, apolipoprotein AI 
promoter, LDL receptor gene promoter, myelin basic protein (MBP) gene promoter, glial 
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fibrillary acidic protein (GFAP) gene promoter, opsin promoter, LCK promoter, CD4 promoter, 
keratin promoter, myoglobulin promoter, or neural-specific enolase (NSE) promoter. 

27. (Withdrawn) The system of claim 16, wherein the polynucleotide is under the control of 
a constitutive promoter. 

28. (Withdrawn) The system of claim 16, wherein the polynucleotide is comprised in a viral 
vector. 

29. (Withdrawn) The system of claim 27, wherein the viral vector is an integrating viral 
vector. 

30. (Withdrawn) The system of claim 28, wherein the integrating viral vector is a lentivirus, 
retrovirus, or adeno-associated virus. 

3 1 . (Withdrawn) The system of claim 16, further defined as comprising: 

(a) a polynucleotide construct comprising a promoter operably linked to at least one 
polynucleotide encoding siRNAs; 

(b) a polynucleotide encoding an inducible repressor that can repress the expression 
of said at least one siRNA; 

(c) one or more vectors comprising the constructs of (a) and (b); and 
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(e) a compound or reagent that may be administered to the cell that controls the 
expression of the flision protein. 



32. (Withdrawn) The system of claim 31, further defined as comprising: 

(a) a polynucleotide construct comprising a polymerase Ill-dependent promoter 
operably linked to at least one polynucleotide encoding siRNAs; 

(b) a polynucleotide encoding a drug-inducible repressor fusion protein that 
comprises a DNA binding domain and a transcription repression domain; and 

(c) a polynucleotide bindable by the binding domain of the fusion protein of (b) and 
positioned such that the transcription repression domain acts to repress 
transcription of the polynucleotide construct of (a); 

(d) one or more vectors comprising the constructs of (a), (b), and (c); and 

(e) a compound that may be administered to the cell that controls the expression of 
the fusion protein or that controls the binding of the fusion protein to the 
polynucleotide sequence bindable by the binding domain of the fusion protein. 



33. (Withdrawn) The system of claim 32 5 wherein the polynucleotide encoding the fusion 
protein is operatively linked to an inducible promoter. 



34. (Withdrawn) The system of claim 32, wherein the promoter is a constitutive promoter. 
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35. (Withdrawn) The system of claim 32, wherein the polynucleotide sequence bindable by 
the binding domain of the fusion protein is the tetracycline operator {tetO) sequence, the fusion 
protein of (ii) is comprised of the DNA binding domain of the tetracycline repressor (tTR) fused 
to the KRAB repression domain of human Kox-1 (tTR-KRAB), and the substance of (c) is 
doxycycline. 

36. (Withdrawn) The system of claim 32, wherein the vector of (b) is a lentiviral vector, an 
MLV vector, an AAV vector, a plasmid vector or an adenoviral (Adv or Ad) vector. 

37. (Withdrawn) The system of claim 35, wherein the polynucleotide sequence bindable by 
the binding domain of the fusion protein of (b), the promoter operably linked to the 
polynucleotide sequence encoding the siRNA and the polynucleotide sequence encoding the 
siRNA are comprised in the U3 region of the 3' long terminal repeat of the lentiviral vector. 

38. (Withdrawn) The system of claim 32, wherein the polynucleotide encoding the fusion 
protein is comprised within a second, separate vector from the vector comprising the constructs 
of (a). 

39. (Withdrawn) The system of claim 38, wherein the second vector comprising the 
polynucleotide encoding the fusion protein is a lentiviral vector, a MLV vector, an AAV vector, 
a plasmid vector or an adenoviral (Adv or Ad) vector. 



40. (Withdrawn) The system of claim 15, wherein the polynucleotide encodes siRNA that 
forms a stem-and-loop structure, or a hairpin. 

25711864.1 .24- 



41. (Previously presented) A mammalian cell comprising a polynucleotide construct 
comprising a region encoding a siRNA operably linked to an externally controllable RNA 
polymerase III promoter, wherein expression of the siRNA is regulated by a polypeptide 
regulator having both a DNA binding domain and a repressor domain, said polypeptide regulator 
being encoded by said cell. 

42. (Withdrawn) The mammalian cell of claim 41 , further defined as comprising: 

(a) a first polynucleotide sequence comprising a promoter operably linked to at least 
one nucleic acid segment encoding an siRNA; 

(b) a second polynucleotide sequence encoding a conditional repressor fusion protein 
that comprises a DNA binding domain and a transcription repression domain; and 

(c) a third polynucleotide sequence bindable by the binding domain of the fusion 
protein of (b) and positioned such that the transcription repression domain acts to 
repress transcription of the nucleic acid segment of (a). 

43. (Withdrawn) The mammalian cell of claim 41, wherein the conditional repressor fusion 
protein is drug inducible. 

44. (Withdrawn) The mammalian cell of claim 42, further comprising 
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(d) a fourth polynucleotide sequence, wherein the fourth polynucleotide sequence is 
excisable and prevents transcription from the polymerase Ill-dependent promoter; 
and, 

(e) a fifth polynucleotide sequence encoding an enzyme capable of excising the 
fourth polynucleotide sequence, wherein the fifth polynucleotide sequence is 
under the control of a regulatable promoter. 

45. (Withdrawn) The mammalian cell of claim 44, wherein the third polynucleotide 
sequence bindable by the binding domain of the fusion protein is the tetracycline operator (tetO) 
sequence, the fusion protein of (b) is comprised of the DNA binding domain of the tetracycline 
repressor (tTR) fused to the KRAB repression domain of human Kox-1 (tTR-KRAB), and the 
fusion protein is controlled by doxycycline. 

46. (Original) The mammalian cell of claim 41, wherein the cell is an undifferentiated cell. 

47. (Original) The mammalian cell of claim 41, wherein the cell is an oocyte or fertilized 
oocyte. 

48. (Withdrawn) A transgenic animal having one or more cells as defined by any one of 
claim 41. 



49. (Withdrawn) A method of creating a transgenic animal capable of exhibiting conditional 
knockdown of a target gene comprising: 
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a) introducing into a sex cell or an undifferentiated embryonic cell an expression 
construct comprising a polynucleotide in accordance with claim 1 ; 

b) fertilizing the cell to create an embryo if the cell is a sex cell; and, 

c) transplanting the embryo into a female animal, wherein the female animal 
produces a transgenic animal. 

50. (Withdrawn) The method of claim 49, wherein the sex cell or undifferentiated embryonic 
cell is an unfertilized oocyte, a fertilized oocyte, an embryonic stem cell, a cell within a morula 
or blastocyst. 

51 . (Withdrawn) The method of claim 49, further comprising culturing the cell prior to 
introduction of the expression construct and/or transplantation. 

52. (Withdrawn) A method of regulating the expression of a gene in a cell, the method 
comprising the steps of: 

(a) preparing a polynucleotide construct in accordance with claim 1, wherein the 
siRNA encoded by said construct downregulates the expression of said gene; 

(b) externally regulating the expression of the encoded siRNA through the externally 
controllable promoter. 
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53. (Withdrawn) The method of claim 52, wherein the externally controllable promoter is a 
repressible promoter whereby expression of the encoded siRNA can be downregulated by means 
of an externally applied agent. 

54. (Withdrawn) The method of claim 53, wherein expression of the encoded siRNA can be 
downregulated by means of an externally applied drug. 

55. (Withdrawn) The method of claim 53, wherein the repressible promoter comprises at 
least one tetO sequence. 

56. (Withdrawn) The method of claim 52, wherein said method further comprises the step of 
providing a polynucleotide encoding an inducible repressor that can repress the expression of the 
siRNA. 

57. (Withdrawn) The method of claim 56, wherein the polynucleotide encoding the repressor 
is further defined as a drug-controllable repressor fusion protein that comprises a DNA binding 
domain and a transcription repression domain, and wherein the binding domain of the fusion 
protein can bind the polynucleotide construct such that the trancription repression domain acts to 
repress transcription of the siRNA. 

58. (Withdrawn) The method of claim 52, wherein the promoter regulating expression of the 
siRNA is a constitutive promoter. 
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59. (Withdrawn) The method of claim 52, wherein the promoter regulating expression of the 
siRNA is a tissue-specific promoter. 

60. (Withdrawn) The method of claim 52, wherein said cell is further defined as being in a 
cell line. 

61. (Withdrawn) The method of claim 59, wherein said method further comprises the step of 
providing to said cell a drug for testing drug function in the absence of a gene product encoded 
by said gene. 

62. (Withdrawn) The method of claim 52, wherein said cell is comprised in an animal. 

63. (Withdrawn) The method of claim 56, wherein the polynucleotide construct encoding the 
siRNA is further defined as being administered to a region of said animal, wherein the animal 
comprises said polynucleotide encoding the repressor. 

64. (Withdrawn) The method of claim 63, wherein said administering is by injection. 

65. (Withdrawn) The method of claim 63, wherein said region of the animal is in an organ. 

66. (Withdrawn) The method of claim 62, wherein the animal is a human patient. 

67. (Withdrawn) The method of claim 66, wherein the gene is an oncogene. 
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68. (Withdrawn) The method of claim 67, wherein the oncogene is ABLI, BLC1, BCL6, 
CBFA1, CBL, CSFIR, ERBA, ERBB, EBRB2, ETS1, ETS1, ETV6, FGR, FOX, FYN, HCR, 
HRAS, JUN, KRAS, LCK, LYN, MDM2, MLL, MYB, MYC, MYCL1, MYCN, NRAS, PIM1, 
PML, RET, SRC, TALI, TCL3 or YES. 

69. (Withdrawn) The method of claim 52, wherein externally regulating the expression of 
the encoded siRNA comprises administering an agent to the patient that effects an upregulation 
or downregulation of said expression. 

70. (Withdrawn) The method of claim 69, wherein the agent is an antibiotic, radiation, 
steroid, hormone, metal, divalent cation, nutrient, or temperature. 

71 . (Withdrawn) The method of claim 70, wherein the radiation is ionizing radiation. 

72. (Withdrawn) A method of controlling the ability of a cell to be recognized 
immunologically, comprising the steps of: 

(a) obtaining a cell comprising: 

(i) a polynucleotide construct in accordance with claim 1, wherein the siRNA 
encoded by said construct downregulates the expression of a 
polynucleotide encoding a transplantation antigen; and 

(ii) a polynucleotide encoding an inducible repressor that can repress the 
expression of said siRNA; and 
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(b) externally regulating the expression of the encoded siRNA through the externally 
controllable promoter. 

73. (Withdrawn) The method of claim 72, wherein said cell is in an animal. 

74. (Withdrawn) The method of claim 72, wherein the transplantation antigen is an MHC I 
antigen. 

75. (Withdrawn) The method of claim 72, wherein said transplantation antigen is beta2- 
microglobulin, an HLA, H-Y, P35B, Kdm4, Kdm5, TL, P198, P91 A, or H-2Kb. 

76. (Withdrawn) The method of claim 75, wherein said HLA antigen is HLA-C, HLA-G, or 
HLA-DQ. 

77. (Withdrawn) The method of claim 74, wherein said cell is an islet cell, a stem cell, a 
hepatocyte, a dopaminergic neuron, or a keratinocyte. 

78. (Withdrawn) A method of treating a disease condition amenable to treatment with an 
siRNA, the method comprising the steps of: 

(a) preparing a polynucleotide construct in accordance with claim 1 , wherein the 
siRNA encoded by said construct is for the treatment of the disease condition; 
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(b) externally regulating the expression of said siRNA through said externally 
controllable promoter. 



79. (Withdrawn) The method of claim 78, wherein the disease is a hyperproliferative 
disorder. 

80. (Withdrawn) The method of claim 79, wherein the hyperproliferative disorder is cancer. 

81. (Withdrawn) The method of claim 80, wherein the cancer is gliosarcoma, breast cancer, 
lung cancer, brain cancer, melanoma, prostate cancer, ovarian cancer, pancreatic cancer, liver 
cancer, colon cancer, cervical cancer, bladder cancer, spleen cancer, head and neck cancer, or 
bone cancer. 

82. (Withdrawn) The method of claim 78, wherein the disease condition is hyperthyroidism. 

83. (Withdrawn) The method of claim 78, wherein said disease condition is associated with 
a hypersecretion defect. 

84. (Withdrawn) The method of claim 83, wherein the hypersecretion defect comprises a 
hormone hypersecretion defect. 
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(57) Abstract 



The present invention is directed to DNA constructs suitable for gene expression in mammalian cells and which are characterized by 
the presence of a mammalian promoter under the control of a tet operatcnf repressor system. The DNA may be used as part of a system for 
expressing recombinant protein. In addition, the tet operator/represser system can be used to engineer cis- and trans-destructive viruses 
which are capable of replicating in the presence of the tet repressor, but not in the absence of the repressor. These viruses can be used 
either directly in the treatment of patients with corresponding viral diseases, as vehicles for the delivery of nucleic acids that can serve as 
therapeutic agents and as part of vaccines designed to immunize people or animals against viral diseases. 



11/30/05, EAST Version: 2.0.1.4 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Annenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


Prance 


LU 


Luxembourg 


SN 


Senegal 


All 


Australia 


GA 


Oabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


EL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


a 


Cote d'lvoire 


BP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


FL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Sf > ynV'i|iiti 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 




SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







11/30/05, EAST Version: 2.0.1.4 



WO 99/00510 



PCT/US98/10907 



REGULATION OF TRANSCRIPTION IN MAMMALIAN CELLS AND VIRAL REPLICATION BY A TETRACYCLIN 
. REPRESSOR 

Field of the Invention 

The present invention is concerned with compositions and methods that rely upon the 
tetracycline resistance (tet) operator and repressor to control transcription in mammalian cells. 
It encompasses methods for recombinantly producing proteins and the vectors and host cells 
utilized in such methods. In addition, the present invention is directed to viruses which are 
recombinantly engineered so that their replication is controlled by the tet operator/repressor 
system. These viruses may serve as vehicles for gene transfer both in vitro and in vivo; as agents 
for immunization; and as a means for delivering nucleic acid therapeutic agents to cells. 

Background of the Invention 

The ability to specifically regulate transgene expression has been a central concern in 
molecular biology for many years. In the case of mammalian cells, the in vitro regulation of 
recombinant genes has most often been accomplished through the use of inducible promoters 
that respond to agents such as heavy metal ions (Brinster, et al y Nature 296:39-42 (1982); heat 
shock (Nover, in Heat Shock Response , pp. 167-220, CRC, Fla. (1991)); and hormones (Klock, 
et al. y Nature 329: 734-736 (1987)). Unfortunately, these promoters generally provide only a 
relatively a low level of expression even in the presence of inducer and most of the inducers 
that have been used in vitro have unacceptable side effects in vivo. 

As an alternative to inducible promoters, attempts have been made to control mammalian 
gene expression using well-characterized prokaryotic regulatory elements. In most cases, 
regulatory systems have relied upon strong interactions between prokaryotic operators and 
repressor proteins as a means for either targeting eukaryotic transcription modulators to specific 
sites within a host cell genome (see e.g., Labow, etai, Mol Cell Biol. 70:3343-3356 (1990)) 
or in attempts to directly inhibit gene expression using the prokaryotic repressor {see e.g., 
Brown, et a/., Cell 49:603-612 (1987)). 

In the case of prokaryotic elements associated with the tetracycline resistance (tet) operon, 
systems have been developed in which the tet repressor protein is fused with polypeptides 
known to modulate transcription in mammalian cells. The fusion protein has then been directed 
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to specific sites by the positioning of the tet operator sequence. For example, the tet repressor 
has been fused to a transactivator (VP16) and targeted to a tet operator sequence positioned 
upstream from the promoter of a selected gene (Gussen, et al., Proc. Nat'l Acad Sci. USA 
50:5547-5551 (1992); Kim, et al.,J. Virol. 69:2565-2573 (1995); Hennighausen, et al.,J. Cell 
Biochem. 50:463-472 (1995)). The tet repressor portion of the fusion protein binds to the 
operator thereby targeting the VP16 activator to the specific site where the induction of 
transcription is desired. An alternative approach has been to fuse the tet repressor to the KRAB 
repressor domain and target this protein to an operator placed several hundred base pairs 
upstream of a gene. Using this system, it has been found that the chimeric protein, but not the 
tet repressor alone, is capable of producing a 10 to 15-fold suppression of CMV-regulated gene 
expression (Deuschle, etaU Mol. Cell Biol. 75:1907-1914(1995)). The main problem with 
these types of systems is that the portion of fusion proteins corresponding to the mammalian 
transactivator or repressor tends to interact with cellular transcriptional factors and cause 
pleiotropic effects. 

Ideally, a system for regulating mammalian gene expression should be highly specific for 
a selected gene and subject to induction by factors suitable for use both in vitro and in yivo. 
The present invention discloses such a system and describes how it can be used to regulate 
transgene expression. In addition, the invention describes how this system can be adapted to 
engineer viruses to serve as vectors, therapeutic agents and vaccines. 

Summary of the Invention 

The present invention is directed to a number of different compositions and methods which 
share the common feature of having gene expression regulated by the tet operator/repressor 
system. 



A. Compositions and Methods for the Production of Recombinant Protein 

In its first aspect, the invention is directed to a recombinant DNA molecule which contains 
a mammalian promoter sequence with a TATA element; at least one tet operator sequence; and 
a gene sequence operably linked to the promoter and lying downstream from the operator. The 
exact positioning of the operator sequence (or sequences) relative to the TATA element is 
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critical to the invention. In order to be effective at controlling transcription, the operator must 
begin at least 6 nucleotides downstream from the last nucleotide in the TATA element and, 
when a gene encoding a protein is expressed, the operator should be positioned before the 
translation initiation codon. In general, the operator should not begin more than about 
100 nucleotides downstream and, preferably, it should begin within 6 to 24 nucleotides 
downstream of the TATA element When positioned in this manner, it has been found that the 
binding of the repressor protein causes an essentially complete shutdown in transcriptional 
activity. This is true even for very strong and highly promiscuous promoters such as the human 
CMV immediate early promoter. 

It is expected that the recombinant DNA molecule described above will, most typically, be 
incorporated into mammalian cells that constitutively express the tet repressor protein. Suitable 
cells may be developed by transforming a mammalian cell line, e.g., U20S cells or Vero cells, 
with a vector containing the tet repressor protein gene operably linked to a promoter active in 
the cells (e.g., a CMV promoter, HSV-1 promoter or SV40 promoter). Alternatively, the DNA 
molecule may contain, in addition to the elements already discussed, a second promoter, 
preferably constitutive, operably linked to the tet repressor gene sequence. The invention 
encompasses, not only the DNA molecules, but also the host cells transformed with the DNAs 
and the recombinant proteins made by the cells. 

The present invention is also directed to a method for recombinantly producing protein in 
which mammalian host cells are transformed with a vector containing a mammalian promoter 
sequence having a TATA element; at least one tet operator sequence positioned at least 6 
nucleotides 3' to the TATA element; and a gene lying 3' to the operator and operably linked to 
the promoter. The gene 3' to the operator may encode an antisense nucleic acid that inhibits the 
expression of a selected gene, a therapeutically active agent (e.g. a tumor suppressor or a 
transdominant negative mutant polypeptide of a cellular protein), a protein of interest for 
experimental purposes or simply a protein whose isolation is desired. In all cases where the 
gene encodes a protein, the operator sequence will be positioned before the translation initiation 
codon of the gene. The transformed cells should constitutively express the repressor protein and 
recombinant gene expression may be induced in the cells by introducing tetracycline. Typically, 
the tet operator sequence will be located between 6 and 100 nucleotides (preferably between 
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6 and 24 nucleotides) 3' to the last nucleotide in the TATA element. The preferred promoter 
is the human CMV immediate-early promoter. It has been found that this system allows for the 
very tight regulation of gene expression, i.e., expression is essentially completely shut off until 
the inducer, tetracycline, becomes available. 

5 The method can be used to produce recombinant protein in cultured mammalian cells or 

in the cells of a transgenic or non-transgenic animal. When a transgenic animal is used for 
production, it will most typically be a mouse and it is necessary that the cells transformed with 
the vector described above be embryonic stem cells. The stem cells may be engineered to 
express the tetracycline repressor by transforming them with the repressor gene operably linked 

10 to a promoter prior to transformation with the tet operator and recombinant gene. Alternatively, 
the repressor gene can be incorporated into the same DNA construct as the tet operator and 
placed under the control of either the same promoter as the gene encoding the recombinant 
protein or under the control of a separate promoter. The transformed stem cells are incorporated 
into a blastocyst to form a chimeric embryo, which is implanted into a pseudopregnant animal. 

1 5 Embryos implanted in this manner are allowed to develop into viable offspring that are screened 
to identify heterozygous animals expressing the recombinant gene. The heterozygous animals 
are then bred to produce homozygous animals that make recombinant protein in response to the 
administration of tetracycline. 

The invention encompasses the transgenic animals made using this method and any 
20 transgenic animal that has integrated into its genome recombinant DNA containing a 
mammalian promoter sequence having a TATA element; at least one tet operator sequence 
positioned at least 6 nucleotides 3' to the TATA element; and a gene lying 3' to the operator 
and operably linked to the promoter. When the gene encodes a protein, the sequence of the 
operator will be positioned before the translation initiation codon of the gene. Typically, the tet 
25 operator sequence will be located between 6 and 100 nucleotides (preferably between 6 and 24 
nucleotides) 3' to the last nucleotide in the TATA element. The preferred promoter is the 
human CMV immediate-early promoter. In addition to the transgenic animals, the invention 
encompasses the recombinant proteins made by these animals. 
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B. Engineered Viruses and Their Uses 

One particularly important use of the tet operator/repressor expression system is in the 
making of viruses in which replication can be controlled. The essential characteristic of these 
viruses is that they contain within their genome at least three related elements: a recombinant 
promoter having a TATA element; at least one tet operator sequence positioned at least 6 
nucleotides 3' to the TATA element; and a gene operably linked to the promoter, which lies 
downstream from the operator and which inhibits viral replication when expressed. Typically, 
the tet operator sequence will be located between 6 and 100 nucleotides (preferably between 
6 and 24 nucleotides) 3' to the last nucleotide in the TATA element The gene lying 
downstream of the operator may act either by encoding a protein that inhibits viral replication 
or by forming a transcription product that inhibits viral replication through an antisense 
mechanism. When the gene encodes a protein, the tet operator sequence will be positioned 
upstream from the translation initiation codon. The engineered virus can be made and grown 
in cultured cells that constitutively express the tet repressor protein. Under these conditions, the 
gene that inhibits viral replication will be shut off, allowing large amounts of virus to be 
produced. Virus may then be collected, purified, and introduced into mammalian cells either 
in vitro or in vivo. Since mammalian cells do not normally make the tet repressor protein, the 
operator sequence will be unoccupied. As a result, the gene lying 3' to the tet operator is 
expressed and viral replication is prevented. 

Viruses engineered in the manner discussed above have a wide range of possible 
applications. First, the viruses can be used as a vehicle for delivering DNA, ( e.g., a gene) to 
mammalian cells. Under these circumstances, a second recombinant promoter will typically be 
incorporated into the viral genome and operably linked to the gene whose expression is desired. 
This second promoter may or may not, be followed by one or more tet operators lying between 
6 and 100 (preferably between 6 and 24) nucleotides downstream from a TATA element in the 
second recombinant promoter. After having delivered the DNA to the host cell, production of 
new virus is inhibited due to the absence of the tet repressor protein. The gene attached to the 
second promoter may encode an antisense nucleic acid that inhibits the expression of a selected 
gene within cells; a therapeutically active protein (e.g. , a tumor suppressor or a transdominant 
negative mutant polypeptide of a cellular protein); or simply a protein that will be isolated or 
that is of interest for experimental reasons. The invention encompasses the method of 
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transforming host cells by transfecting them with the virus, the transformed host cells 
themselves and the recombinant proteins made by the host cells. 

The viruses discussed above may also be used to immunize subjects. The great advantage 
of vaccines containing the engineered viruses that, because the viruses will not replicate after 
5 they arc injected into subjects, the risk of active viral infection due to immunization is greatly 
reduced. To further ensure that virus replication will not occur, additional mutations may be 
introduced into the viruses, e.g. a deletion mutation may be introduced into one or more 
essential viral genes. In general, viruses containing such additional mutations will be preferred. 

The engineered viruses also have utility in the direct treatment of patients for viral 
infections. The first step in this method involves transforming a second virus (i.e., a virus other 
than the one that has infected the patient although possibly of the same strain) by incorporating 
into its genome: DNA comprising a mammalian promoter with a TATA element; at least one 
tet operator sequence positioned at least 6 nucleotides 3' to the TATA element (typically 
between 6 and 100 nucleotides 3' to the TATA element); and a gene positioned 3' to the 
operator and operably linked to the promoter. This gene should be chosen so that, when 
expressed, it is capable of blocking the replication of both the second virus and the virus which 
has infected the patient. In cases where the gene encodes a protein, the sequence of the tet 
operator will be positioned before the translation initiation codon of the gene. The transformed 
second virus is grown in host cells expressing the tet repressor protein, thereby allowing large 
amounts of viral progeny to be produced. Virus is collected, purified and then administered to 
the patient. In preferred embodiments, the tet operator is located between 6 and 24 nucleotides 
downstream from the last nucleotide in the TATA box and the promoter used is the human 
CMV immediate-early promoter. 

Finally, the present invention is directed to a method for delivering a nucleic acid 
25 therapeutic agent to cells. The nucleic acid therapeutic agent may comprise either an antisense 
fragment that inhibits the expression of a cellular protein, or a gene that encodes a protein with 
a therapeutic action. The virus is engineered to contain within its genome: i) a recombinant 
mammalian promoter with a TATA element; at least one tet operator sequence positioned at 
least 6 nucleotides 3' to the TATA element and 5' to a translation initiation codon; and iii) a 
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gene positioned 3' to the operator and operably linked to the promoter. When this gene is 
expressed, viral replication is inhibited. Typically, the tet operator sequence will be located 
between 6 and 100 nucleotides (preferably between 6 and 24 nucleotides) 3' to the last 
nucleotide in the TATA element. The preferred promoter is the immediate-early promoter of 
5 human CMV. In addition, the virus must contain within its genome the nucleic acid encoding 
the therapeutic agent operably linked to a second promoter. This second promoter may, or may 
not, be followed by one or more tet operators lying, typically, between 6 and 100 (preferably 
between 6 and 24) nucleotides downstream from a TATA element in the second promoter. 

After the preparation of the viral vector for delivering therapeutic agent, the next step in the 
10 method is to grow a large amount of the virus in a host cell that expresses the tet repressor 
protein. The virus grown in this manner is collected, purified and then administered to the 
patient. Since the patient would not normally have cells synthesizing tet repressor, replication 
of virus will be blocked but transcription of the nucleic acid therapeutic agent will proceed. 

Brief Description of the Figures 

1 5 Figure 1 . Diagram of the hCMV major immediate-early enhancer-promoter and strategy 

for creating a tetR-responsive transcription switch. Panel A shows a DNA sequence containing 
two tandem tet operators used for generating the tet operator-bearing hCMV major immediate- 
early enhancer-promoter. Panel B shows DNA sequences surrounding the TATA element and 
cis-acting sequences known to interact with cellular transcription factors. A Sac I restriction site 

20 used for insertion of tet operator is underlined. 

Figure 2. Insertion of tet operator sequences immediately downstream of the TATA element 
converts the hCMV major immediate-early enhancer-promoter to a tetR-sensitive transcription 
switch. Vero cells were seeded at 3 x 10 s cells per 60 mm dish and, at 24 hours after seeding, 
cells were transfected with 0.5 jig of pWRG1630 or pCMVtetOEGF alone or in the presence 
25 of 1 |ig, 2 jig and 3 jig of the tet repressor-expressing plasmid, pcDNA3-tetR, either in the 
absence or presence of tetracycline at 1 |ig/ml. pUC19 vector plasmid was used to balance the 
pCDNA3-tetR and 3.5 \xg of plasmid DNA was used in the transfection assay. Extracellular 
medium was collected from transfected cells every 20-24 hours and fresh growth medium was 
added either with or without 1 jig/ml of tetracycline. hEGF in extracellular medium collected 
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from 0 to 20 hours (panel A), 20 to 44 hours (panel B), and 44 to 68 hours (panel C) was 
determined by ELIS A with the use ofanti-hEGF specific monoclonal and polyclonal antibodies. 

Figure 3. Release of tetR-mediated repression by tetracycline. Vero cells were transfected 
with either pCMVtetOEGF (0.5 pg) or pCMVtetOEGF (0.5 pg) and pcDNA3-tetR (2 pg) in 
the absence of tetracycline. 20 hours after transfection, extracellular medium was collected and 
fresh growth medium was added to the transfected cells in the absence or presence of 
tetracycline at 0.1 pg/ml or 1 pg/ml for an additional 24 hours. hEGF expression in 
extracellular medium collected from 0 to 20 hours (panel A) and 20 to 44 hours (panel B) was 
determined by ELISA. Two independent experiments are shown for each indicated co- 
transfection assay. 

Figure 4. Determination of the efficacy of tetR-mediated cumulative regulation of transgene 
expression using luciferase as a reporter. Vero cells in 60 mm dishes were transfected with 
0.5 pg pCMVtetOGL2 alone or 0.5 pg of pCMVtetOGL2 together with 2 pg of pCDN3-tetR 
in the absence of tetracycline from 0 to 20 hours and in the absence or presence of 1 pg/ml of 
tetracycline from 20 to 70 hours. At 70 hours after transfection, Vero cells were lysed in 0.5 ml 
of Ix luciferase lysis buffer in the presence of 0.2 mM of PMSF, 100 pg/ml of TPCK and 
1 mM of leupeptin for 15 minutes at room temperature. Insoluble cellular debris was removed 
by centrifugation in a microcentrifuge for 20 minutes at 4°C and luciferase activity was then 
measured as mV per 10 pg of protein. 

Figure 5. In vivo regulation of the hCMV major immediate-early enhancer-promoter by the 
tet repressor. A total of 1 8 partial thickness wounds (15x15x15x1.2 mM) were created on 
porcine dorsal skin. Nine wounds received 0.2 pg of pCMVtetOEGF and 0.8 pg of pcDNA3 
vector DNA and the others received 0.2 pg of pCMVtetOEGF and 0.8 pg of pcDNA3-tetR by 
particle-mediated gene transfer. After particle bombardment, each transfected wound was 
enclosed in a sealed vinyl adhesive chamber containing 1 .2 ml of isotonic saline in the presence 
of 100 units/ml penicillin and 100 pg/ml streptomycin. Wound fluid was collected from the 
chambers at 22, 46 and 70 hours after gene transfer and stored at -70°C. Following collection 
of wound fluid at each indicated time point, a new chamber was applied. Each pig was given 
500 mg of tetracycline by intravenous injection at 46 hours after gene transfer. hEGF expression 
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in wound fluid was determined by ELISA. The vertical line associated with each bar represents 
standard error. 

Figure 6. Inhibition of HSV-l replication by the transdominant negative form of the UL9 
peptide and reversibility using the tet repressor. Vero cells were seeded at 5 x 10 5 cells per 
60 mm. At 20 to 24 hours after seeding, the cells were transfected with 0, 1 ng of purified 
infectious HSV-l DNA either alone or in the presence of 0. 1 fig of either pCMVtetOUL9-C57 1 
or pCMVtetOUL9-nl0/C535. Transfections were carried out in the presence of 1.5 \ig of 
pCDNA3 vector DNA or the tet repressor-expressing plasmid, pCDNA3-tetR. Fourteen hours 
after transfection, medium was removed followed by the addition of methylcellulose to the 
transfected cells at 10 ml per dish. Viral plaques were visualized by staining transfected cells 
with neutral red at 68 to 72 hours post-transfection and plates were counted 14 hours later. 

Figure 7. Reversibility of HSV-l replication inhibition using tetracycline. Vero cells were 
transfected with three different sets of DNA vectors: 1) 0.2 fig of infectious HSV-l DNA and 
2.1 fig of pCDNA3; 2) 0.2 fig of infectious HSV-l DNA, 0.1 fig of pCMVtetOUL9-C571 and 
2 ng of pCDNA3; and 3) 0.2 ng of infectious HSV-l DNA, 0.1 fig of pCMVtetOUL9-C571 
and 2 fig of pCDNA3-tetR. Transfections were carried out either in the presence or absence 
of tetracycline at I jig/ml. Sixteen hours after transfection, medium was removed from cells 
and 5 ml of fresh medium was added to each dish either with or without tetracycline at a 
concentration of 5 fig/ml. At 48 hours after transfection, cells were harvested and virus yields 
were determined The results of this determination are shown in the figure. - 

Definitions 

The description that follows uses a number of terms that refer to recombinant DNA 
technology. In order to provide a clear and consistent understanding of the specification and 
claims, including the scope be given such terms, the following definitions are provided. 

Viral vector : As used herein, "viral vector" and equivalent terms refer to viruses that are 
utilized for transferring selected DNA or RNA sequences into a host cell. The vectors maybe 
utilized for the purpose of transferring DNA into cells either in vitro or in vivo. Viruses that 
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have been commonly used for the latter purpose include the retroviruses, adenoviruses, 
parvoviruses and herpes viruses. 

Expression vector : This and comparable terms refer to a vector which is capable of 
inducing the expression of DNA that has been cloned into it after transformation into a host 
cell. The cloned DNA is usually placed under the control of (i.e., operably linked to) certain 
regulatory sequences such a promoters or enhancers. Promoters sequences maybe constitutive, 
inducible or repressible. 

Substantially pure or purified : As used herein, "substantially pure" or "purified" means that 
the desired product is essentially free from contaminating cellular components. Containments 
may include, but are not limited to, proteins, carbohydrates and lipids. One method for 
determining the purity of a protein or nucleic acid is by electrophoresis in a matrix such as 
polyacrylamide or agarose. Purity is evidence by the appearance of a single band after staining. 

Host : Any prokaryotic or eukaryotic cell that is the recipient of a vector is the host for that 
vector. The term encompasses prokaryotic or eukaryotic cells that have been engineered to 
incorporated a gene in their genome. Cells that can serve as hosts are well known in the art as 
are techniques for cellular transformation (see e.g., Sambrook, et aL. Molecular Cloning: A 
laboratory Manual . 2nd ed., Cold Spring Harbor (1989)). 

Promotor : A DNA sequence that initiates the transcription of a gene. Promoters are 
typically found 5' to the gene and located proximal to the start codon. If a promotor is of the 
inducible type, then the rate of transcription increases in response to an inducing agent. 

Expression . Expression is the process by which a polypeptide is produced from DNA. The 
process involves the transcription of the gene into mRNA and the translation of this mRNA into 
a polypeptide. Depending on the context in which used, "expression" may refer to the 
production of RNA, protein or both. 

Recombinant : As used herein, the term "recombinant" refers to nucleic acid that is formed 
by experimentally recombining nucleic acid sequences and sequence elements. A recombinant 
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host would be any host receiving a recombinant nucleic acid and the term "recombinant 
protein" refers to protein produced by such a host. 

Operablv linked : The term "operably linked" refers to genetic elements that are joined in 
such a manner that enables them to carry out their normal functions. For example, a gene is 
operably linked to a promotor when its transcription is under the control of the promotor and 
such transcription produces the protein normally encoded by the gene. 

Nucleic acid therapeutic ag ent: This term refers to any nucleic acid sequence which 
directly, or indirectly, serves as a therapeutic agent. Typically, such agents will fall into two 
categories. The first category encompasses antisense nucleic acids that are designed to anneal 
to complementary sequences within the host cell, thereby inhibiting expression. Alternatively, 
the term may refer to nucleic acids that encode a therapeutic protein. 

Gene : As used herein, "gene" refers to the nucleic acid sequence that undergoes 
transcription as the result of promoter activity. A gene may code for a particular protein or, 
alternatively, code for an RNA sequence that is of interest in itself, e.g. because it acts as an 
antisense inhibitor. 

Mammalian promoter : The term "mammalian promoter 11 refers to promoters that are active 
in mammalian cells. Similarly, "prokaryotic promoter" refers to promoters active in prokaryotic 
cells. 

Essential viral gene : The term "essential viral gene" is defined as a gene that is necessary 
for viral replication. 

Essential cellular g ene: This refers to a gene that is necessary for cellular survival 

Detailed Description of the Invention 

The present invention is based upon the concept that is it possible to regulate mammalian 
gene expression using the tet operator and repressor protein. Provided that the operator is 
positioned at least 6 nucleotides downstream from the last nucleotide of the TATA element of 
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the promoter controlling expression, regulation can be accomplished without the needfo fuse 
the repressor protein to other mammalian transcription modulators. 

Although not critical, a knowledge of the basic functioning of the tetracycline resistance 
(tet) operon in bacteria may help in understanding the way in which the invention works. In the 
5 tet operon, a tetracycline resistance gene (tetA) and gene encoding the tet repressor protein 
(tetR) are both under the control of the same promotor and operator elements. In the absence 
of tetracycline, the tet repressor protein binds to the operator DNA sequence, thereby sterically 
preventing the adjacent promotor from interacting with RNA polymerase. Thus, transcription 
of both tetA and tetR are blocked. When the level of tetracycline within the bacterium 
10 increases, the tetracycline binds to the repressor protein causing it to detach from the operator 
sequence. As a result, the polymerase is able to bind to the promotor sequence and both the tetA 
and tetR genes are transcribed. 

The strong interaction between the tet repressor protein and the tet operator has provided 
a mechanism for targeting eukaryotic regulatory proteins to specific sites within the genome of 

15 a cell. As discussed above, previous systems have been described in which the tet operator is 
positioned upstream from a mammalian gene to serve as a target for fusion proteins comprised 
of the tet repressor and a mammalian transcription activator or repressor. The tet repressor 
portion of the fusion protein binds to the operator sequence, thereby positioning it upstream 
from the gene to be expressed. The remaining portion of the fusion protein then serves to 

20 modulate gene expression by interacting with cellular transcription factors. 

The main problem with these types of systems is that pleiotropic effects are caused by the 
interaction of the mammalian transcription modulator with transcriptional factors at sites 
distinct from the operator. Previous attempts to modulate gene expression using the tet 
repressor protein alone, (/.c, other than as a fusion protein) have been unsuccessful {see e.g., 
25 Kim, el a/., J. Virol 69:2565-2573 (1995); Deuschle, et al y Mol Cell Biol 75:1907-1914 
(1995)). It has now been discovered that successful modulation of gene expression using tetR 
alone can be accomplished by inserting one or more tet operators approximately 10 base pairs, 
a full DNA helix turn, downstream of the tet operator. Using this approach, it has been possible 
to tightly regulate transcription controlled by the hCMV major immediate-early enhancer- 
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promoter, one of the most potent and promiscuous eukaryotic elements. This can be done both 
in vitro and in vivo. 

I. The Tet Operator as a Transcriptional Switch 

In its first aspect, the present invention is directed to recombinant DNA molecules 
5 containing a mammalian promoter sequence with a TATA element. A tetracycline operator 
sequence is positioned at least 6 nucleotides 3' to the TATA element and is followed by a DNA 
sequence whose transcription is controlled by the promoter. Procedures for either synthesizing 
or purifying promoters, operators and other DNA sequences are well known in the art and 
standard techniques in molecular biology can be employed for constructing DNA molecules 
10 with appropriately arranged elements (see e.g., Sambrook et ai, Molecular Cloning : A 
Laboratory Manual. 2nd ed., Cold Spring Harbor Press (1989)). Examples of preferred 
methods are provided in the "Examples" section along with the complete sequence of the tet 
operator. 

Any type of promoter active in mammalian cells can be used in the invention including 
15 those that are inducible, repressible or constitutive. Preferred mammalian promoters include 
that of the mouse metallothionein I gene (Hamer, et ai J. Mol Appl Gen. 7:273-288 (1982)); 
the immediate-early and TK promotor of herpes virus (Yao et ai, J. Virol. tfP:6249-6258 
(1995); McKnight, Cell J7:355-365 (1982)); the SV 40 early promotor (Benoist, et al % Nature 
290:304-3 10 (1981)); and, especially, the human CMV immediate-early promotor (Boshart, et 
20 al. Cell 41 ;52 1 -530 ( 1 985)). Full length or minimal promoters may be used and other regulatory 
elements, (see e.g. Figure I) may be included. As discussed in the "Examples" section, the full 
human CMV major immediate-early enhancer-promotor has been successfully used in the 
invention and it will be understood that, unless otherwise specified, reference to the "human 
CMV immediate-early promoter" includes both the promoter per se, as well as the promoter 
25 in combination with any or all of the other transcriptional regulatory elements shown in Figure 
t. 

The promotor is separated from the sequence undergoing transcription by one or more tet 
operator sequences that begin at least 6 nucleotides downstream from the TATA element. 
Typically, the operator will begin at a position between 6 and 100 nucleotides (and preferably 
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between 6 and 24 nucleotides) downstream from the TATA element. The arrangement of these 
elements must not substantially interfere with the ability of the promoter to direct the 
transcription of the downstream sequence or the translation of the gene product. 

Typically, the DNA molecule described above will be incorporated into a vector (e.g.. a 
plasmid or virus) which contains other transcription or translational elements. If desired, large 
amounts of vector DNA can be generated, (e.g., but transferring the vector into bacteria that 
make the repressor protein). Preferably, the vector is then transferred into a mammalian host 
cell which has been engineered to express the tet repressor. One way to engineer mammalian 
cells to express the tet repressor is to operably link the repressor gene sequence to a second 
promoter, incorporate this into the vector containing the tet operator and then transfer the DNA 
into the cells. Alternatively, cells may be transformed with an expression vector containing the 
tet repressor sequence prior to the transfer of the construct containing the tet operator. An 
example of a plasmid that has been used to produce cells expressing the tet repressor is 
pcDNA3-tetR (see "Examples" section). 

Any method for introducing expression vectors into cells maybe used with the present 
invention including calcium phosphate precipitation, microinjection, electroporation, liposomal 
transfer, viral transfer or particle mediated gene transfer. When transfers are done to host cells 
in vivo, the preferred method of transformation is by means of a viral vector. Cells that have 
incorporated constructs can be identified using hybridization techniques well known in the art 
or by using the polymerase chain reaction (PCR) to amplify specific recombinant sequences. 
If the recombinant DNA transferred into the cells produces a protein that can be detected, e.g., 
by means of an immunological or enzymatic assay, then the presence of recombinant protein 
can be confirmed by introducing tetracycline into cells and then performing the assays either 
on the medium surrounding the cells or on cellular lysates. 

In the absence of tetracycline, host cells transformed with the constructs should not express 
substantial amounts of recombinant DNA. Expression of recombinant DNA sequences 
incorporated into hosts cells is induced using either tetracycline per se or a tetracycline 
analogue. The latter is defined as any compound which is related to tetracycline in the sense 
that it maintains the ability to bind with specificity to the tet repressor. The dissociation 



11/30/05, EAST Version: 2.0.1.4 



WO 99/00510 PCT/US98/I0907 

15 

constants of such analogues should be at least I x 1 (T 6 M and preferably greater than t x iO" 9 
M. Examples of analogues that can be used include, but are not limited to, those discussed by 
Hlavka, et al ("The Tetracyclines," in Handbook of Experimental Pharmacology 78, 
Blackwood, et al. (eds.), New York (1985)) and Mitschef ("The Chemistry of Tetracycline 
5 Antibiotics," Medicinal Res. 9, New York (1978)). Similarly, minor modifications in the 
sequence of the repressor or the operator will not affect the invention provided that such 
modifications do not substantially reduce either the affinity or specificity of the 
repressor/operator interaction. 

II. Method for Recombinant^ Producing Protein in Vitro and in Vivo 
10 The vectors and DNA constructs discussed above can be used as part of a method for 

recombinantly producing protein either in vitro or in vivo. In vitro, mammalian host cells are 
preferred for the production of protein and include U20S cells, Vero cells, NIH-3T3 cells, CHO 
cells, Hela cells, LM(tk-) cells, etc. Vectors suitable for use in each of these various cell types 
are well known in the art {see e.g., Sambrook, et al., supra). 

1 5 The DNA constructs may also be used to produce recombinant proteins in vivo using both 

transgenic and non-transgenic animals. Although production in any type of transgenic animal 
is compatible with the invention, it is expected that mice will be used in most cases. Typically, 
mouse embryonic stem (ES) cells will be transformed with the DNA constructs and then 
incorporated into a developing mouse embryo. Any ES cell line which has the ability to 

20 integrate into and become part of the germ line of the developing embryo may be used, e.g., the 
murine cell line D3 (ATCC, 12301 Parklawn Drive, Rockville, Md., catalog no. CR 1934). The 
cells are cultured and prepared for DNA insertion using methods well-known in the art {See, 
e.g., Robertson, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach , 
Robertson, ed., I.RX. Press Washington, D.C. (1987); Bradley, et al., Current Topics in Devel. 

25 Biol 20; 357-371 (1986); and Hogan, et al., Manipulating the Mouse Embrvo: A Laboratory 
Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, M.Y. (1986)). Stem cells 
will need to be engineered to express the tet repressor protein, and, as discussed above, this can 
be done either by incorporating the repressor gene into the same construct containing the tet 
operator or by separately transforming cells with a repressor gene-containing construct. 
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DNA can be incorporated into cells using any method known in the art, but most typically, 
this transfer will be accomplished using electroporation. If the DNA construct has been 
inserted into a plasmid-type vector, it is preferred that the DNA be linearized prior to 
transfection. Linearization can be accomplished by digesting the DNA vector with a suitable 
5 restriction endonuclease selected to cut outside of the DNA sequence to be expressed. The 
screening of transfected stem cells can be carried out using any of a variety of methods. For 
example, Southern hybridizations may be carried out using labeled probes that are specific to 
a sequence located within the DNA transferred into cells. Alternatively, PCR amplification can 
be used for selected sequences. 

10 After embryonic stem cells have been transformed and selected, the next step is to 

incorporate the cells into an embryo. The preferred method for accomplishing this is by 
microinjection of the stem cells into an embryo at the blastocyst stage of development. In mice, 
blastocysts at about 3.5 days of development may be obtained by perfusing the uterus of 
pregnant animals. Appropriate methods for carrying this out are well known in the art {see 

15 Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach. (1987)). 
Preferred blastocysts are male and have genes s for a phenotypic marker (e.g. coat color) that 
is different from the phenotypic marker encoded by the stem cell genes. In this way, offspring 
can be easily screened. 

The next step in the process of producing transgenic animals involves implanting the 
20 chimeric embryo into the uterus of a pseudopregnant animal. Such animals are typically 
prepared by mating females with vasectomized males of the same species. The pseudopregnant 
stage of the female is important for successful implantation and will vary from species to 
species. For mice, females about two to three days pseudopregnant should typically be used. 

After chimeric embryos have been implanted into pseudopregnant animals, they are allowed 
25 to develop to term and offspring are then screened. In cases where a phenotype selection 
strategy has been employed, initial screening may be accomplished by simple inspection of 
animals for mosaic coat color or for some other readily apparent phenotypic marker. In 
addition, or as an alternative, chromosomal DNA may be obtained from the tissue of offspring, 
e.g., from the tail tissue of mice, and screened for the presence of recombinant DNA using 
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Southern blots and/or PCR amplification. Homozygous transgenic animals may flien be 
produced by interbreeding heterozygotes and then used to provide a continual supply of animals 
that are capable of expressing recombinant DNA. Expression can be controlled by maintaining 
the animals in the absence of tetracycline until recombinant synthesis is desired. Under these 
conditions, the tet repressor protein will bind to the operator sequence thereby inhibiting the 
activity of the recombinant promoter. Tetracycline, or a tetracycline analog, once administered 
to animals will readily cross cell membranes and then cause the tet repressor protein to 
dissociate from the operator sequence. Thus, the recombinant gene downstream from the 
recombinant promoter will start being transcribed. 

Animals made in this manner, may be used for research purposes, e.g., to study the effects 
of various drugs or, alternatively, they may be used for the purpose of producing recombinant 
protein. In the latter case, it is preferred that the recombinant genes expressed in the cells be 
linked to a signal sequence that causes protein to be secreted into the blood of the animals. This 
may then be collected to serve as a source for the purification of recombinant protein. 

III. Recombinant^ Engineered Virus 

A. The Making of Recombinant Virus, Vaccines and Anti-viral Treatment 

The tet operator/repressor regulatory system described above can be used to engineer 
viruses in which the production of progeny is tightly regulated. This can be done by 
incorporating into the viral genome a construct containing a promoter (preferably the human 
CMV immediate-early promoter), the tet operator sequence at a position at least 6 nucleotides 
3' to the TATA element and a gene 3' to the operator and operably linked to the promoter. This 
gene inhibits viral replication when expressed and may take the form of an antisense sequence 
that binds to RN A encoding a protein necessary for viral replication or, alternatively, the gene 
may encode a protein that inhibits replication. In the latter case, the tet operator sequence will 
be positioned before the translation initiation codon of the gene. An example of a protein that 
will inhibit viral replication is the transdominant negative form of the UL9 protein of HSV-1 
which binds to the HSV-1 origin of replication and, when over- expressed, blocks new viruses 
from being formed. Similar proteins have been found to exist in many other viruses as well. 
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Viruses as described above can be generated in cells that constitutively produce" the tet 
repressor protein. Under these circumstances, the repressor will bind to the tet operator 
sequence and inhibit the expression of the gene downstream. Thus, viral inhibitory DNA 
sequences can be incorporated into the viral genome and large amounts of virus can be 
5 produced. For example, the repressor might block the synthesis of the mutant form of the UL9 
protein, thereby allowing the production of HSV-1. If desired, tetracycline or a tetracycline 
analog may be introduced into cells. The tetracycline will bind to the repressor protein and 
thereby cause it to dissociate from the operator sequence. Transcription of nucleic acid from 
the recombinant promoter would then proceed and viral replication would be inhibited. 

10 It should be noted that the system described above can be used both in vitro and in vivo. 

For example, large amount of virus can be grown by infecting cultured cells that make the tet 
repressor protein. The viruses can then be collected, purified and administered to a subject 
Once administered, the virus delivers its DNA to the cells within the subject but, because the 
tet repressor protein is not present, transcription of recombinant DNA within the viral genome 

15 proceeds and viral replication is inhibited. These characteristics, in themselves, make the 
engineered virus particularly attractive for use in immunization procedures and in the treatment 
of viral diseases. 

B. The Use of Engineered Virus in Immunization Procedures 

Most immunization procedures are carried out by exposing a subject to a particular disease- 
20 causing agent which has been modified so that it provokes an immunological response without 
actually causing the disease. For example, vaccines containing either dead or attenuated virus 
may be given to an individual to immunize them against polio. Viruses engineered using the 
tet operator/repressor system can be grown in large numbers in cultured cells making the tet 
repressor and then administered to patients as part of a vaccine. The patients thus treated would 
25 be exposed to the proteins normally present on the virus and will therefore mount an 
immunological response. However, because mammalian cells do not normally make the tet 
repressor, the virus will not be able to replicate and full-fledged exposure to the disease will be 
prevented. 
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In order to further ensure that the virus is not made, additional mutations can be introduced 
into the recombinant virus. For example, a deletion mutation may be introduced into an 
essential viral gene. The latter virus could be made and grown in cells expressing both tetR and 
the wild type form of the essential viral gene. 

This approach to immunization could be used for virtually all infectious viruses that have 
been isolated and could be applied both to the immunization of people as well as animals. 

C. The Use of Engineered Virus in the Treatment of Viral Diseases 
Viruses engineered using the tet operator/repressor system of the present invention can be 
used directly in the treatment of viral infections. For example, an HSV-1 virus could be 
engineered in the manner described above to contain within its genome a construct made up of 
a strong mammalian promoter, the tet operator sequence and the gene encoding the 
transdominant negative mutant form of UL9. The engineered HSV-l could be grown in large 
numbers in cultured cells expressing the tet repressor and then administered to patients 
suffering from an HSV-1 infection. The engineered virus would enter into the patient f s cells and 
express the transdominant negative mutant UL9 protein. This would serve to inhibit not only 
the replication of the engineered HSV-1 but also the HSV-1 that had originally infected the 
patient In effect, the engineered virus is serving as a vehicle for delivering antiviral agents in 
vivo. Because the engineered virus shares the same cellular specificity as the infecting virus, 
it is ideally suited for therapy. 

The animal and human viruses for which engineered virus could serve as either a vaccine 
or therapeutic agent include, without limitation, arboviruses; avian leukosis virus; CELO virus; 
Chagres virus; rhinoviruses; Coxsackie virus; hemorrhagic viruses; equine encephalomyelitis 
virus; hepatitis viruses; herpes viruses; infectious porcine encephalomyelitis virus; influenza 
viruses; Newcastle disease virus; papilloma virus; parainfluenza viruses; poliomyelitis virus; 
respiratory syncytial virus; Rous sarcoma virus; St. Louis encephalitis virus; dengue virus; 
Sendai virus; and rabies virus. 
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D. Engineered Viruses as Vectors for the Delivery of Nucleic Acid Therapeutics 

With minor modifications, the engineered viruses discussed above can be used for 
delivering any type of nucleic acid therapeutic agent to cells. These agents may take the form 
of either antisense nucleic acids that bind to complementary sequences to inhibit their 
expression as proteins, or as genes encoding proteins with a therapeutic action. 

The nucleic acid sequence that will be used as a therapeutic agent must be operably linked 
to a promoter which is active in the cells in which therapy is needed. This may either be the 
same promoter regulating the recombinant gene controlling viral replication or, alternatively, 
a second distinct promoter within the viral genome. The basic procedure to be followed in 
treating patients is essentially the same as that discussed above in connection with the use of 
engineered viruses for treating viral infections. Specifically, the virus engineered to contain 
nucleic acid therapeutic agent will be grown in cells that produce the tet repressor protein. 
Viruses made in this manner are collected, purified and administered to the subject in need of 
treatment. The engineered viruses then infect the subject's cells and, once inside, begin 
expressing both the nucleic acid inhibiting viral replication and the nucleic acid serving as a 
therapeutic agent. Although this system is ideally suited to gene therapy, it can also be utilized 
as a mechanism for delivering nucleic acids to cells in vitro, or as a means for attempting to 
engineer cells in vivo. For example, DNA constructs designed for homologous recombination 
to either replace defective counterparts or prevent abnormal gene expression may be delivered 
in this manner. 

As discussed above, additional mutations may be introduced into an essential viral gene in 
order to ensure that virus is not replicated. 

Examples 

Example 1 : Conversion of human CMV Major Immediate-early Enhancer-promoter 
to a Regulatory Switch Using the tet Repressor 

A. Materials and Methods 

Reporter and tet Expression Plasmids : Plasmid pWRG1630 is a human EGF expression 
plasmid in which a sequence coding for mature hEGF is controlled by the hCMV major 
immediate-early enhancer-promoter. There are two Sac I sites in pWRG1630 and one of these 
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Sac I sites is located three bases downstream of the TATA element of the hCM V major 
immediate-early promoter. To construct pCMVtetOEGF, the oligonucleotide: 

5' -CTCCCTATCAGTGATAGAGATCTCCCTATCAGTGATAGAGAT 

CQTCGACGAGCT -3' 

and its complementary sequence were annealed and purified by 15% polyacrylamide gel 
electrophoresis as previously described (Yao, et al. y J. Virol ^5:8158-8168 (1994)). The 
tetracycline (tet) operator sequence is shown in bold face (Heuer, et al J, Mol Biol. 202:407- 
415 (1988)) and the Sal I restriction enzyme site used for cloning analysis is underlined. The 
purified double stranded tet operator-containing fragment was then inserted at the Sac I site of 
the hCMV immediate-early promoter in plasmid pWRG1630 by partial digestion of 
pWRG1630 with Sac I. The insertion of a tetO sequence in pWRG1630 created a unique Sal 
I site and insertion of tetO in the hCMV immediate-early promoter created an Eco RI-Bam HI 
hCMV promoter-containing fragment of 701 base pairs. Figure 1 shows a schematic diagram 
of the tetO-containing hCMV immediate-early promoter in plasmid pCMVtetOEGF used in the 
study. 

pCMVGL2 and pCMVtetOGL2 are plasmids derived from a pGL2-basic vector (Promega, 
Madison, WI) in which the cDNA-encoding firefly luciferase is under the control of the wild- 
type hCMV promoter or the tetO-bearing hCMV promoter. To generate these two plasmids, the 
Eco RI-Bam HI hCMV promoter-containing fragment from pWRG1630 or the hCMV-tetO 
promoter-containing fragment from pCMVtetOEGF was inserted into the Sma I and Bgl II site 
of the pGL-basic vector. 

The tetracycline repressor expressing plasmid, pcDNA3-tetR, was constructed by first 
inserting the Bgl I-Sal I-tetR containing fragment of pSGStetR into the Xba I and Sal I site in 
pGEM3Z to generate pGEM3Z-tetIL The Sal I and Kpn I -tetR fragment of pGEM3Z-tetR was 
then cloned into the EcoR V-Kpn I site in the pcDNA3 vector. 
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Cell Culture and Transfection 

African green monkey kidney (Vero) cells were grown and maintained in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum. Cells were 
seeded at 2 to 3 x 10 5 cells per 60 mm dish. At 20 to 24 hours post-seeding, cells were 

5 transfected with either 0.5 jig of pWRG1630 or 0.5 \ig of pCMVtetOEGF in the presence of 
2 jig of pUC 19 vector DNA or 2 ng of pcDNA3-tetR by lipofectin-mediated transfection. The 
transfection was carried out in serum and antibiotic free DMEM for 16-20 hours followed by 
removal of the transfection medium and addition of 5 ml of normal growth medium in the 
presence or absence of tetracycline. The preparation of lipofectin-DNA complexes was carried 

10 out according to the procedure of the manufacturer (GIBCOBRL, Life Technologies) at 1 0 |il 
of lipofectin per 2.5 ng of plasmid DNA. 

For luciferase assays, Vero cells were seeded and transfected in a manner similar to that 
described above, with the exception of using 0.5 fig of pCMVtetOGL2 in the presence of 2 \ig 
of pUC19 vector DNA, or 2 fig of pcDNA3-tetR. At 20 hours after transfection, the lipofectin- 
15 plasmid DNA containing medium was removed and cells were re-fed with normal growth 
medium in the presence or absence of 1 jxg/ml of tetracycline. Cells were harvested at 70-72 
hours post-transfection and cell extracts were prepared according to the protocol described by 
the manufactured (Promega). 

Particle-Mediated Gene Transfer: 
20 Pigs used for in vivo gene transfer were domestic female Yorkshire pigs, 3 to 4 months old 

and weighing 40-45 kg. Partial thickness wounds (15X15X1. 2 mm) were made on porcine 
dorsal skin with a dermatome using Halothane (1-1.5%) anesthesia in a 3:5 mixture of 
oxygen/nitrous oxide. 

Preparation of cartridges with coated DNA-gold beads for Accell (Agracetus/Geniva, Inc.) 
25 particle-mediated gene transfer and the utilization of the Accell helium gene gun were 
according to the protocol provided by Geniva, Inc. (8520 University Green, Middleton, WI 
53562). Each partial thickness wound was provided with 0.2 jig of hEGF expressing plasmid 
and 0.8 jig of pcDNA3 or 0.2 ng hEGF expressing plasmid and 0.8 pig of pcDNA3-tetR. The 
driving pressure used was 800 pounds per square inch (psi). Following DNA transfer, the 
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tf ansfected wounds were enclosed in sealed vinyl adhesive chambers containing 1 .2 ml of 
isotonic saline in the presence of 100 units/ml penicillin and 100 fig/ml streptomycin. Wound 
fluid was withdrawn from the chambers at 22 hours post-gene transfer and the transfected sites 
were enclosed in new chambers. Following the collection of wound fluid and application of 
new chambers at 46 hours after gene transfer, pigs were given 500 mg of tetracycline by 
intravenous injection. At 24 hours after the administration of tetracycline, wound fluid was 
collected and stored at -70 degrees C. Levels of EGF in wound fluid was determined by ELISA 
with anti-HEGF specific antibody. 

ELISA: 

Expression ofhEGF in extracellular medium and wound fluid was determined on microtiter 
plates (96 wells) with the use of anti-hEGF specific monoclonal antibody (MAB236, R&D 
systems) as the primary coating antibody at 75 ng per well and anti-hEGF specific polyclonal 
antibody (sc275, Santa Cruz) as secondary antibody at 100 ng per well. The HRP-conjugated 
goat anti-rabbit polyclonal antibody (sc-2004, Santa Cruz) was used as tertiary antibody at 3.33 
ng per well. The peroxidase assay was performed according to the procedures of the TMB 
peroxidase EIA substrate kit (BIO-RAD) and analyzed on a Bmax Kinetic Microplate Reader 
(Molecular Devices Corporation, Sunnyvale, CA). The concentration ofhEGF in samples was 
fit to a SOFTmax 4-parameter standard curve generated with the use of recombinant hEGF 
(234-EG, R&D systems) in a two-fold dilution ranging from a concentration of a 2 pg to 200 
pg/ml in a volume of 200 \i\ per well. 

B. Results. 

In Vitro Regulation of the hCMV Major Immediate-Early Enhancer-Promoter by the 
Tetracycline Repressor: The hCMV major immediate-early enhancer-promoter represents one 
of the most potent cts-regulatory units for directing the expression of transgenes in mammalian 
cells. In addition to the TATA element, a variety of upstream cis-acting element have been 
identified (Figure I) and, by interacting with cellular and viral transactivators, these elements 
ensure highly efficient transcription directed by the TATA element. Transcription initiation 
requires basal transcription factors to interact with the TATA element in a coordinate fashion 
to form a transcription pre-initiation complex. The TATA-binding protein (TBP) is the first 
and only basal transcription factor to interact with DNA specifically and the binding of TBP to 
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the TATA element signals the transcription of the promoter. The present experiments were 
designed to test whether the tetracycline repressor can convert the hCMV major immediate- 
early enhancer-promoter into a regulatory switch by interacting with two tet operators about 10 
base pairs downstream of the hCMV TATA element in plasmid pWRG1630. The tet operator 
sequences were positioned so that the tet repressor would bind to the same side of the DNA 
helix as the TATA-binding protein. Based upon their close proximity, it was hypothesized that 
the binding of the tet repressor to the tet operator would either block the binding of the TBP to 
the TATA element or interfere with the assembly of the pre-initiation complex directly. 

Vero cells were transfected with 0.5 ng of pWRG1630 or pCMVtetOEGF either alone or 
in the presence of 1 |ig, 2 |xg, and 3 ^ig of the tet repressor expressing plasmid pcDNA3-tetR 
in medium either with no tetracycline or with tetracycline at a concentration of 1 ng per ml. 
Extracellular medium was collected from transfected cells every 20-24 hours, followed by the 
addition of fresh growth medium either with or without 1 \xg of tetracycline. The hEGF 
concentration in the collected extracellular medium was determined by ELISA. 

The results shown in Figure 2 demonstrate that the expression of human EGF from 
pWRG1630 was not affected by the presence of pcDNA3-tetR and that the insertion of the tet- 
operator containing sequence near the hCMV major immediate-early enhancer-promoter has 
no effect on human EGF expression in the absence of tetR. Expression of EGF from 
pCMVtetOEGF was significantly reduced in the presence of tetR in a dose and time dependent 
manner. In the presence of 3 ng of tetR repressor-expressing plasmid, i.e,. pcDNA3-tetR, EGF 
expression from pCMVtetOEGF was repressed approximately 200 fold at 20 hours post- 
transfection, 1000 fold at 20-24 hours post-transfection, and 3500 at 44-68 hours post- 
transfection in the absence of tetracycline. Litde or no repression was observed in the presence 
of tetracycline. In the presence of 2 \ig of pcDNA3-tetR, approximately a 100-fold, 600-fold, 
and 2000-fold inhibition of expression was detected at 0-20 hours, 20-44 hours and 44-48 hours 
post-transfection. In the presence of 1 ng of pcDNA3-tetR, approximately a 60-fold, 100-fold, 
and 200-fold reduction in the synthesis of human EGF was observed at the three indicated time 
points. There was no human EGF expression in mock transfected Vero cells. 

To test if tetR-mediated repression can be efficiently reversed by tetracycline, Vero cells 
were transfected with either pCMVtetOEGF alone or pCMVtetOEGF and pcDNA3-tetR in the 
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absence of tetracycline from 0-20 hours (Figure 3A) and in the presence or absence of 
tetracycline from 20-44 hours (Figure 3B). The results demonstrate that the repression observed 
from 0-20 hours post-transfection can be efficiently reversed by the presence of 1 fig/ml of 
tetracycline while 0. 1 fig/ml of tetracycline is not sufficient to reverse tetR-mediated repression 
under the conditions tested. Consistent with the experiments presented in Figure 2, the data 
demonstrated that the basal promoter activity of pCMVtetOEGF was reduced 100-200 fold 
during 0-20 hours post-transfection, and about 500 fold from 20-44 hours post-transfection in 
the presence of 2 jig of pcDNA3-tetR. 

Having demonstrated the kinetics of tetR-mediated regulation of the hCMV major 
immediate-early enhancer-promoter with a secretable peptide, human EGF, the ability of this 
system to regulate the expression of a non-secretable polypeptide, firefly luciferase was tested. 
Figure 4 shows the results of two independent experiments in which Vero cells were either 
transfected with 0.5 \ig of pCMVtetOGL2 alone, or co-transfected with 0.5 \xg of 
pCMVtetOGL2 and 2 jig of pcDNA3-tetR in the presence or absence of I ng/ml of tetracycline. 
The levels of luciferase expression from pCMVtetOGL2 were decreased at least 100-fold in the 
presence of the tetR-expressing plasmid, pcDNA3-tetR, and it was found that this repression 
could be released efficiently by tetracycline. The level of luciferase expression from the wild- 
type hCMV immediate-early enhancer-promoter are not affected by the presence of pcDNA3- 
tetR. When similar experiments were performed on HeLa cells, a 40-50-fold repression was 
detected at 68-72 hours post-transfection. This indicates that, like the tetR-VP16 based 
activating system, the efficiency of the tetR-based repression is cell type dependent. 

In Vivo Regulation of the hCMV Major Immediate-Early Enhancer-Promoter by the 
Tetracycline Repressor,: The data presented above demonstrate that: (I) the tet repressor is 
capable of acting as a potent sequence-specific trans-repressor in cultured mammalian cells; and 
(2) that the insertion of two tandem operators about 10 base pairs downstream of the promoters 
TATA element, converts the promoter into an effective tetracycline-dependent transcriptional 
switch. To test if this tetR-tet operator regulatory unit is functional in vivo, partial thickness 
wounds were created on porcine dorsal skin, with nine wounds receiving 0.2 fig of 
pCMVtetOEGF and 0.8 fig of pcDNA3 vector DNA and nine wounds receiving 0.2 jig of 
pCMVtetOEGF and 0.8 fig of pcDNA3-tetR per wound. As shown in Figure 5, human EGF 
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expression in partial thickness wounds co-transfected with pcDNA3-tetR was significantly 
lower than that observed in partial thickness wounds co-transfected with pcDNA3 vector 
plasmid in the absence of tetracycline. A 13 -fold repression was detected one day after gene 
transfer. 

Notably, although human EGF expression in pCMVtetOEGF transfected wounds was 
increased approximately 3 -fold from day one to day two post-gene transfer, yields of human 
EGF in partial thickness wounds co-transfected with pcDNA3-tetR were reduced 1 .5-fold. 
Collectively, in the presence of tetR, levels of human EGF expression were repressed 
approximately 55-fold at day two post-gene transfer in the absence of tetracycline. It is of 
particular significance that, upon receiving tetracycline through intravenous injection from day 
2 to day 3 post-gene transfer, the tetR-mediated repression was released as evidenced by a 4- 
fold increase of human EGF expression in wounds receiving both pCMVtetOEGF and 
pcDNA3-tetR. In wounds transfected with pcMVtetOEGF alone, there was about a 4-fold 
reduction in EGF expression from day 2 to day 3 post-gene transfer. This observation proves 
the feasibility of using this regulatory switch in controlling the expression of transgenes in gene 
therapy. 

C. Discussion 

Regulation of transgene expression in target cells represents one of the most critical and 
challenging aspects of gene therapy. Using the hCMV major immediate-early enhancer- 
promoter as a prototype mammalian cell promoter, it has been demonstrated that, placing 
tetracycline operators 10 base pairs of the TATA element, enables the tetracycline repressor to 
function as a potent repressor of gene expression in mammalian ceils. 

Recently, by fusing the KRAB repressor domain of the human KOX1 zinc-finger protein 
with the tet repressor and inserting DNA sequences encoding seven tet operators 685 base pairs 
upstream of the transcription initiation site, it has been shown that the tet-KRAB chimeric 
protein, but not tetR alone, can suppress the hCMV major immediate-early enhancer-promoter 
approximately 10-15-fold in HeLa cells in a transient expression assay using luciferase as a 
reporter (Deuschle, etal., Mol Cell Biol 75:1907-1914 (1995)) . Using a different strategy, 
in which tet operators were inserted 10 base pairs, a full helix turn, downstream of the TATA 
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element, it has been shown that the hCMV major immediate-early enhancer-promoter can be 
tightly regulated by tetR alone. Based on the study of Heuer & Hillen (J. Mol. Biol 202:407- 
415 (1988)) it was hypothesized that this specific design would place the tet repressor on the 
same side of the DNA helix as TBP and the binding of tetR to the tet operator provides a direct 
steric block for TBP. Using hEGF as a secretable promoter, the kinetics of tetR-mediated 
repression was explored. Close to a 4000-fold repression was observed in vitro at 3 days post- 
transfection. Combining a porcine wound model with particle-mediated gene transfer, this study 
has provided a direct in vivo confirmation of this tetR-mediated regulatory switch in fine tuning 
the expression of transgenes for gene therapy. 

Unlike other tet repressor/operator regulatory systems, e.g., the tetR-VP16 based activating 
and tetR-KRAB repressor system, the regulatory switch disclosed herein does not require the 
use of tetracycline repressor/mammalian cell transactivator or repressor fusion proteins to 
achieve its effects. Thus, the potential pleiotropic effects on the expression of cellular genes 
caused by cellular transcription factors are minimal and higher levels of expression of the 
regulator (i.e., the tetracycline repressor) can be achieved. Notably, the efficacy of the tetR- 
mediated regulatory switch was found to vary significantly in vitro as compared to in vivo. This 
apparent difference can probably be explained by: 1) differences in the means of gene transfer 
which may lead to different co-transfection efficiency; and 2) differences in cell types. 

Example 2: Viral Replication Switch 

To test whether the tetR-regulated transcription switch discussed above can be converted 
into a novel viral replication switch to regulate de novo viral production in a reversible fashion 
and produce a transdestructive recombinant virus, the following experiments were performed 
using herpes simplex virus type 1 as a prototype. 

A, Construction of Trans-dominant Negative HSV-1 UL9 Mutant Polypeptide 
Expressing Plasmids 

The UL9 protein is one of the seven HSV-1 essential gene products that are directly 
involved in viral replication. UL9 binds specifically to the HSV-1 origin of DNA replication. 
It is a nuclear phosphoprotein 851 amino acids in length. Studies have shown that the C- 
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terminal amino acids 535-851 of UL9 contain the DNA binding domain of the protein and, 
when over-expressed, it can block vims DNA replication in a dominant negative fashion. 

In order to clone, the C-terminal 3 1 7 amino acids of UL9 and place it under the control of 
the tet operator-containing hCMV major immediate-early enhancer-promotor, the Bam HI - Not 
5 I EGF-containing fragment in plasmid pCMVtetOEGF was replaced by the Bam HI - EcoR V 
UL9-containing fragment from plasmid pSP6UL9. The resulting plasmid was designated 
pCMVtetOUL9-C571 and expresses the C-terminal amino acids 571-851 of UL9. 

To construct plasmid pCMVtetOUL9-nlO/C535, a plasmid expressing a UL9 protein 
fragment containing amino acids 1 to 10 of UL9 and amino acids 535 to 851 of UL9, a double 
10 stranded oligo encoding the first 10 amino acids of the UL9 protein followed by amino acids 
Thr-Met-Gly was inserted into the Bam HI site of pCMVtetOUL9-C571. Plasmid 
pCMVtetOUL9-C535C, which expresses the C-terminal amino acids 535 to 851 of UL9, was 
constructed by the religation of Bam HI- Kpn I digested pCMVtetOUL9-nlO/C535. 

B. Transient Inhibition Analysis of HSV-1 Replication 

15 To test if the mutant UL9 polypeptides encoded by pCMVtetOUL9-C571 and 

pCMVtetOUL9-nl0/C535 can function as trans-dominant negative mutant polypeptides 
inhibiting HSV-1 replication, and, most importantly, to test whether an inhibitory effect can be 
regulated by the tet repressor, Vero cells were seeded at 5 x 10 5 cells per 60 nm. At 20 to 24 
hours after seeding, the cells were transfected with 0. 1 micrograms of purified infectious HS V- 

20 1 DNA alone or co-transfected with 0.1 micrograms of pCMVtetOUL9-C571 or 
pCMVtetOUL9-nl0/C535 in the presence of 1.5 micrograms of pcDNA3 vector DNA or the 
tet repressor- expressing plasmid, pcDNA3-tetR, by lipofectin. At 14 hours post transfection, 
the lipofectin-DNA containing transfection medium was removed followed by addition of 
methylcellulose to the transfected cells at 10 ml per dish. Viral plaques were visualized by 

25 staining transfected dishes with neutral red at 68 to 72 hours post transfection and counting 14 
hours later. As shown in Figure 1, co-transfection of infectious HSV-l DNA with 
pCMVtetOUL9-C571 reduces the viral placque forming efficiency approximately 30 fold. 
When co-transfected with pCMVtetOUL9-nlO/C535C, the placque forming efficiency of 
infectious HSV-l DNA was reduced at least 100 fold. Significantly, both C571- and 
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nl0/C535C- mediated repression of HSV-1 DNA replication can be efficiently silenced My the 
tet repressor. When a similar experiment was performed with pCMVtetOUL9-C535C, the 
placque formation of HSV-1 DNA was reduced at least 200 fold and again, this C535C- 
mediated repression can be efficiently reversed by tetR. 

Having demonstrated that the inhibitory effects of the trans-dominant negative C-terminal 
UL9 polypeptides on HSV-1 replication can be efficiently silenced by the tet repressor, the 
specificity of this tetR related viral replication switch was further investigated. Vero cells were 
transfected with: 1)0.2 micrograms of infectious HS V- 1 DNA and 2. 1 micrograms of pcDN A3 ; 
2) 0.2 micrograms of infectious HSV-1 DNA, 0.1 micrograms of pCMVtetOUL9-57l and 2 
micrograms of pcDNA3; and 3) 0.2 micrograms of infectious HSV-1 DNA, 0.1 micrograms 
of pCMVtetOUL9-C571 and 2 micrograms of pcDNAJ-tetR. Transfections were carried out 
either in the absence or the presence of tetracycline at 1 microgram per ml. At 16 hours post 
transfection, the transfection medium was removed and 5 ml of fresh medium was added to 
each dish with either no tetracycline or tetracycline at a concentration of 5 micrograms per ml. 
At 48 hours post transfection, cells were harvested and virus yields were determined. The data 
presented in Figure 2 demonstrate that: 1) C571 -mediated expression of HSV-l replication can 
be reversed by the tet repressor; and 2) this tetR-regulated reversion of HSV-1 replication is 
tetracycline specific as evidenced by the effect of pCMVtetOUL9-571 on HSV-1 placque 
forming units was significantly reduced in the presence of tetracycline. 

Collectively, these observations demonstrate that, by combining transdominant negative 
mutant viral polypeptides with the tetR-regulated potent mammalian transcription switch, a 
novel viral replication switch can be generated. In principle, any polypeptide or antisense RN A 
that is capable of inhibiting viral productive infection can be incorporated into this novel viral 
replication switch. Using this switch, a trans-destructive or inhibitory viral vector can be 
generated while tetR is not present in the viral genome. This trans-inhibitory viral vector is not 
only capable of serving as a vehicle for in vivo gene transfer, but is also capable of inhibiting 
the endogenous and/or latent virus replication. This invention can also be used for generating 
a viral vaccine which not only is capable of inducing an effective host immune response, but 
which is also able to function as a therapeutic agent helping to eliminate endogenous viral 
infection when encountered within the same cell. 
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All references cited herein are folly incorporated by reference. Having now folly described 
the invention, it will be understood by those of skill in the art that the invention may be 
practiced and wide and equivalent range of conditions, parameters and the like, without 
affecting the spirit or scope of the invention or any embodiments thereof. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Brigham and Women's Hospital 

(ii) TITLE OF INVENTION: Tetracycline Repressor Regulated Mammalian Cell Transcription 

Switch and Viral Replication Switch 

(iii) NUMBER OF SEQUENCES: 1 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Vinson & Elkins L.L.P. 

(B) STREET: 1455 Pennsylvania Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S. 

(F) ZIP: 20004-1008 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sanzo, Michael A. 

(B) REGISTRATION NUMBER: 36,912 

(C) REFERENCE/DOCKET NUMBER: BRI33 1/87001 

(ix) TELECOMMUNICATION INFORMATION: 
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(A) TELEPHONE: (202) 639-6585 

(B) TELEFAX: (202) 639-6604 

(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 
CTCCCTATCA GTGATAGAGA TCTCCCTATC AGTGATAGAG ATCGTCGACG AGCT 
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What is Claimed is: 

1 . A recombinant DNA molecule comprising: 

a) a mammalian promoter sequence having a TATA element; 

b) at least one tet operator sequence positioned at least 6 nucleotides 3' to the TATA 
element; and 

c) a gene lying 3' to said operator and operably linked to said promoter. 

2. The recombinant DNA of claim 1, wherein said tetracycline operator sequence is 
positioned between 6 and 24 nucleotides 3' to said TATA element. 

3. The DNA molecule of claim 1, wherein said promoter is the human cytomegalovirus 
(hCMV) immediate-early promoter. 

4. The DNA molecule of claim 1, further comprising a gene encoding the tet repressor 
protein. 

5 . A host cell transformed with a vector comprising the DNA molecule of claim I . 

6. Recombinant protein made by the host cell of claim 5. 

7. A method for recombinantly producing protein in a mammalian cell that makes the tet 
repressor protein, said method comprising: 

a) transforming said mammalian cell with a vector comprising: 

i) a mammalian promoter sequence having a TATA element; 

ii) at least one tet operator sequence positioned at least 6 nucleotides 3' to the 
TATA element; and 

iii) a gene lying 3' to said tet operator and operably linked to said promoter; 

b) introducing tetracycline into the transformed cells of step a) to induce the expression 
of said gene. 
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8. The method of claim 7, wherein said tet operator sequence is positioned between 6 and 24 
nucleotides 3' to said TATA element. 

9. The method of claim 7, wherein said promoter is the human CMV immediate-early 
promoter. 

10. The method of claim 7, wherein said mammalian cell is an embryonic stem cell and, prior 
to the introduction of tetracycline to induce gene expression, the method further comprises: 

(i) incorporating said stem cell into a blastocyst to form a chimeric embryo; 

(ii) implanting said chimeric embryo into a pseudopregnant animal; 

(iii) allowing said chimeric embryo to develop into a viable offspring; 

(iv) screening ofispring to identify heterozygous animals expressing said gene; and 

(v) breeding said heterozygous animals to produce homozygous transgenic animals 
producing said protein. 

1 1 . A transgenic animal made by the method of claim 10. 

1 2. A transgenic animal wherein said animal has integrated into its genome recombinant DNA 
comprising: 

a) a mammalian promoter sequence having a TATA element; 

b) at least one tet operator sequence positioned at least 6 nucleotides 3' to the TATA 
element; and 

c) a gene lying 3' to said operator and operably linked to said promoter. 

13. The recombinant DNA molecule of claim 12, wherein said promoter is the human 
cytomegalovirus (hCMV) immediate-early promoter. 

14. The recombinant DNA of claim 12, further comprising a gene encoding the tet repressor 
protein. 

15. Recombinant protein made by the transgenic animal of claim 12. 
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16. A recombinantly engineered virus comprising within its genome: 

a) a recombinant promoter having a TATA element; 

b) at least one tet operator sequence positioned at least 6 nucleotides 3' to the TATA 
element; and 

c) a gene lying 3' to said operator and operably linked to said promoter, wherein said 
gene inhibits the replication of said virus when expressed. 

17. The virus of claim 16, further comprising one or more mutations in at least one essential 
viral gene. 

18. The virus of claim 16, wherein said tet operator element is positioned between 6 and 24 
nucleotides 3' to said TATA element. 

19. The virus of claim 16, wherein said promoter is the human CMV immediate-early 
promoter. 

20. The virus of claim 16, further comprising: 

a) a second recombinant promoter located within the viral genome; and 

b) a second recombinant gene operably linked to said second recombinant promoter. 

2 1 . The virus of claim 20, further comprising one or more mutations in an essential viral gene. 

22. The virus of claim 20, further comprising at least one tet operator sequence lying at least 
6 nucleotides 3' to a TATA element in said second recombinant promoter and 5' to said 
second recombinant gene. 

23. A host cell made by transfecting a cell with the virus of claim 16. 

24. Recombinant protein made by the host cell of claim 23 

25. A vaccine comprising the virus of claim 16. 
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26. A method for treating a patient for an infection by a first virus, comprising: 

a) transforming a second virus by incorporating into its genome DNA comprising: 

i) a mammalian promoter having a TATA element; 

ii) at least one tet operator sequence positioned at least 6 nucleotides 3' to the 
TATA element; and 

iii) a gene positioned 3' to said operator and operably linked to said promoter, 
wherein said gene, when expressed, is capable of blocking the expression of 
both said first virus and said second virus; 

b) growing the transformed second virus of step a) in a host expressing the tet repressor 
protein; 

c) collecting and purifying the virus grown in step b); and 

d) administering the virus collected and purified in step c) to said patient. 

27. The method of claim 26, wherein said operator is between 6 and 24 nucleotides 3' to said 
TATA element. 

28. The method of claim 26, wherein said promoter is the human CMV immediate-early 
promoter. 

29. The method of claim 26, wherein step a) further comprises: 

iv) introducing one or more mutations in an essential viral gene. 

30. A method for delivering a nucleic acid therapeutic agent to cells, comprising: 

a) preparing a virus to serve as a vector, wherein said virus is engineered to contain 
within its genome: 

i) a recombinant mammalian promoter having a TATA element; 

ii) at least one tet operator sequence positioned at least 6 nucleotides 3' to the 
TATA element; and 

iii) a gene positioned 3' to said operator and operably linked to said promoter, 
wherein said gene encodes a protein capable of inhibiting the replication of said 
virus; 

iv) said nucleic acid therapeutic agent, operably linked to a second promoter; 
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b) growing the virus prepared in step a) in host cells expressing the tet repressor protein; 

c) collecting and purifying the virus grown in step b); and 

d) administering the virus collected and purified in step c) to said patient. 

31. The method of claim 30, wherein said virus further comprises at least one tet operator 
sequence lying at least 6 nucleotides 3' to a TATA element in said second recombinant 
promoter and 5' to said second recombinant gene. 

32. The method of claim 30, wherein said tet operator sequence is positioned between 6 and 
24 nucleotides 3' to said TATA element. 

33. The method of claim 30, wherein said recombinant mammalian promoter is the human 
CMV immediate-early promoter. ' 

34. The method of claim 30, wherein said nucleic acid therapeutic agent acts as an antisense 
inhibitor of gene expression. 

35. The method of claim 30, wherein said nucleic acid therapeutic agent encodes a protein with 
a therapeutic action. 

36. The method of claim 30, wherein step a) further comprises: 

iv) introducing one or more mutations in an essential viral gene. 
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FIG.2B 
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Functional anatomy of siRNAs for mediating 
efficient RNAi in Drosophila melanogaster 
embryo lysate 



Sayda M. Elbashir, Javier Martinez, 

Agnieszka Patkaniowska, 

Winfried Lendeckel and Thomas Tuschl 1 

Department of Cellular Biochemistry, Max-Planck-Insutute for 
Biophysical Chemistry, Am Fassberg 11, D-37077 Gottiagen. 
Germany 

'Corresponding author 

e-mail: ttuschl@mpibpc.gwdg.de 

S.M .Elbashir and J.Martinez contributed equally to this work 

Duplexes of 21-23 nucleotide (nt) RNAs are the 
sequence-specific mediators of RNA interference 
(RNAi) and post-transcriptional gene silencing 
(PTGS). Synthetic, short interfering RNAs (siRNAs) 
were examined in Drosophila melanogaster embryo 
lysate for their requirements regarding length, struc- 
ture, chemical composition and sequence in order to 
mediate efficient RNAi. Duplexes of 21 nt siRNAs with 
2 nt 3' overhangs were the most efficient triggers of 
sequence-specific mRNA degradation. Substitution of 
one or both siRNA strands by 2'-deoxy or 2 , -0-methyl 
oligonucleotides abolished RNAi, although multiple 
2'-deoxynucleotide substitutions at the 3' end of 
siRNAs were tolerated. The target recognition process 
is highly sequence specific, but not all positions of a 
siRNA contribute equally to target recognition; 
mismatches in the centre of the siRNA duplex prevent 
target RNA cleavage. The position of the cleavage site 
in the target RNA is defined by the 5' end of the guide 
siRNA rather than its 3' end. These results provide a 
rational basis for the design of siRNAs in future gene 
targeting experiments. 

Keywords: PTGS/RNA interference/small interfering 
RNA 



Introduction 

Post-transcriptional gene silencing (PTGS) mediated by 
double-stranded (ds) RNA represents an evolutionarily 
conserved cellular defence mechanism for controlling the 
expression of alien genes in protists, filamentous fungi, 
plants and animals (Fire, 1999; Bass, 2000; Cogoni and 
Macino, 2000; Carthew, 2001; Hammond et aL, 2001b; 
Sharp, 2001; Tuschl, 2001; Voinnet, 2001; Waterhouse 
et aL, 2001). It is believed that random integration of alien 
genes (such as transposons) or viral infection causes 
production of dsRNA, which activates sequence-specific 
degradation of homologous single-stranded mRNA or 
viral genomic RNA, thereby preventing expression or 
replication of the foreign genetic material. The dsRNA is 
used as the guide RNA in this sequence-specific RNA 
degradation process. In some cases, dsRNA may also be 



involved in amplification of the silencing signal important 
for systemic spread (Palauqui et aL, 1997; Voinnet et aL, 
1998) or long-term maintenance of silencing (Dalmay 
et aL, 2000; Mourrain et aL, 2000; Sraardon et al. t 2000). 
In animals, the dsRNA-triggered silencing effect is 
referred to as RNA interference (RNAi; Fire et al. f 1998). 

One important feature of the mechanism of RNAi 
is the processing of long dsRNAs into duplexes of 
21-25 nucleotide (nt) RNAs. These short RNA products 
were first detected in plant tissues that exhibited transgene- 
or vims-induced PTGS (Hamilton and Baulcombe, 1999), 
but were also found later in fly embryos and worms 
injected with long dsRNAs (Panish et aL, 2000; Yang 
et aL, 2000) or in extracts from Drosophila melanogaster 
Schneider-2 (S2) cells that were transfected with dsRNA 
(Hammond et aL, 2000). The processing reaction of long 
dsRNAs to 21-23 nt RNAs was first recapitulated in vitro, 
in extracts prepared from D. melanogaster embryos 
(Zamore et aL, 2000) and later in extracts from S2 cells 
(Bernstein et aL, 2001). In the embryo lysate, it was 
observed that the target mRNA was cleaved in -21 nt 
intervals (Zamore et aL, 2000) and that synthetic 21 and 
22 nt RNA duplexes added to the lysate were able to guide 
efficient sequence-specific mRNA degradation, while 
duplexes of 30 bp dsRNA were inactive (Elbashir et aL, 
2001b). The 21 nt RNA products were therefore named 
small interfering RNAs or silencing RNAs (siRNAs). 

A ribonuclease III enzyme, dicer, is required for 
processing of long dsRNA into siRNA duplexes 
(Bernstein et aL, 2001). It was recently shown that dicer 
has an additional cellular function and is also required for 
excision of 21 and 22 nt small temporal RNAs (stRNAs) 
from -70 nt stable stem-loop precursors (Grishok et aL, 
2001; Hutvagner et aL, 2001). These tiny expressed RNA 
molecules are important regulators of developmental 
timing and control the translation of downstream regula- 
tory genes (Ambros, 2000; Moss, 2000; Pasquinelli et aL, 
2000). stRNAs are different from siRNAs in that the target 
mRNA is not degraded during silencing (Wightman et aL, 
1993; Olsen and Ambros, 1999) and they are single 
stranded (Reinhart et aL, 2000), while siRNAs are 
believed to be double stranded (Elbashir et aL, 2001b; 
Hutvagner et aL, 2001). 

In RNAi, a siRNA-containing endonuclease complex 
cleaves a single-stranded target RNA in the middle of the 
region complementary to the 21 nt guide siRNA of the 
siRNA duplex (Elbashir et aL, 2001b). This cleavage site 
is one helical turn displaced from the cleavage site that 
produced the siRNA from long dsRNA, suggesting 
dramatic conformational and/or compositional changes 
after processing of long dsRNA to 21 nt siRNA duplexes. 
The target RNA cleavage products are rapidly degraded 
because they either lack the stabilizing cap or poly(A) tail. 
A protein component of the -500 kDa endonuclease or 
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*RNA-induced silencing complex (RISC) was recently 
identified and is a member of the argonaute family of 
proteins (Hammond et a/., 2001a); however, it is currently 
unclear whether dicer is required for RISC activity. 

It is also unknown whether RISC contains single- or 
double-stranded siRNAs. By analogy to stRNA excision, it 
may be envisaged that only one of the strands of a siRNA 
duplex is incorporated into a catalytic siRNP, but because 
of the symmetry of the siRNA duplex, two approximately 
equal populations of sense and antisense strand -containing 
catalytic siRNPs are produced. Synthetic siRNA duplexes 
cleaved sense as well as antisense target RNAs in the 
middle of the region covered by the siRNA duplex in 
D.melanogaster lysate (Elbashir et a/., 2001b). However, 
longer dsRNAs did not produce symmetric sense and 
antisense target RNA cleavage sites in embryo lysate 
(Elbashir et aL y 2001b), suggesting that the direction of 
processing of long dsRNA defined which of the strands of 
the resulting siRNA duplex could be used for guiding 
target degradation. Some protein, involved in the produc- 
tion of the 21 nt siRNA duplexes, may be deposited on the 
siRNA duplex to mark the strand that is going to be used 
for guiding target RNA cleavage. 

Despite the lack of profound mechanistic understand- 
ing, RNAi has rapidly developed into an important tool for 
reverse genetics and has been widely applied in 
Caenorhabditis elegans (Fraser et al. y 2000; Gonczy 
et al % 2000; Piano et aL, 2000; Maeda et al. t 2001), as 
well as in insects (see references in Lam and Thummel, 
2000) and insect cell lines (Clemens et ai, 2000; 
Hammond et al. y 2000; Ui-Tei et aL, 2000). RNAi has 
also been shown to occur in a variety of vertebrates by 
targeting of mRNAs important for embryonic develop- 
ment. In differentiated mammalian cells, dsRNAs with 
>30 bp generally activate the interferon response, which 
leads to a global shut-off in protein synthesis as well as 
non-specific mRNA degradation (Stark et al., 1998). This 
unspecific response to long dsRNAs can be bypassed using 
21 nt siRNA duplexes, resulting in specific knock-down of 
the expression of the targeted gene (Elbashir et al. % 2001a; 
Hutvagner et al. t 2001), providing a new method for 
analysis of mammalian gene function in cultured cells. 

Here we describe the results of a systematic analysis of 
the length, secondary structure, sugar backbone and 
sequence specificity of siRNA duplexes for RNAi, using 
the established D.melanogaster embryo in vitro system. 
The most potent siRNA duplexes are 21 nt long, 
comprising a 19 nt base-paired sequence with 2 nt 3'- 
overhanging ends. The 5' end of the target-complementary 
siRNA strand (guide siRNA) sets the ruler to define the 
position of target RNA cleavage. Furthermore, we find that 
target recognition is extremely specific, as even single 



nucleotide mismatches between the siRNA duplex and the 
target mRNA abolish interference. These results provide a 
rational basis for the design of siRNAs for future gene 
targeting experiments. 



Results 

Variation of the 3' overhang in duplexes of 
21 nt siRNAs 

We reported previously that two or three unpaired 
nucleotides at the 3' end of siRNA duplexes were more 
efficient in target RNA degradation than blunt-ended 
duplexes (Elbashir et al, 2001b). To perform a more 
comprehensive analysis of the function of the terminal 
nucleotides, we synthesized five 21 nt sense siRNAs, each 
displaced by one nucleotide relative to the target RNA, and 
eight 21 nt antisense siRNAs, each displaced by one 
nucleotide relative to the target (Figure 1 A). By combining 
these sense and antisense siRNAs, a series of eight siRNA 
duplexes with symmetric overhanging ends were gener- 
ated spanning a range from 7 nt 3' overhang to 4 nt 5' 
overhang. The interference was measured using the dual 
luciferase assay system (Tuschl et a/., 1999; Zamore et a/., 
2000). siRNA duplexes were directed against firefly 
luciferase mRNA and sea pansy luciferase mRNA was 
used as internal control. The luminescence ratio of target 
to control luciferase activity was determined in the 
presence of siRNA duplex and was normalized to that 
observed in its absence. For comparison, the interference 
ratios of long dsRNAs (39-504 bp) are shown in Figure IB 
(Elbashir et a/., 2001b). The interference ratios were 
determined at concentrations of 5 nM for long dsRNAs 
(Figure 1A) and at 100 nM for siRNA duplexes 
(Figure IC-J). The 100 nM concentration of siRNAs 
was chosen because complete processing of 5 nM 504 bp 
dsRNA would result in 120 nM total siRNA duplexes. 

The ability of 2 1 nt siRNA duplexes to mediate RNAi is 
dependent on the number of overhanging nucleotides or 
base pairs formed. Duplexes with 4-6 3 / -overhanging 
nucleotides were unable to mediate RNAi (Figure IC-F), 
as were duplexes with two or more 5 '-overhanging 
nucleotides (Figure 1G-J). The duplexes with 2 nt 3' 
overhangs were most efficient in mediating RNA inter- 
ference, although the efficiency of silencing was also 
sequence dependent and up to 12-fold differences were 
observed for different siRNA duplexes with 2 nt 3' 
overhangs (compare Figure ID-H). Duplexes with blunted 
ends, 1 nt 5' overhang or 1-3 nt 3' overhangs were 
sometimes functional and sometimes completely inactive. 
The small silencing effect observed for the siRNA duplex 
with 7 nt 3' overhang (Figure 1C) may be due to an 
antisense effect of the long 3' overhang rather than to 



Fig. I. Variation of the 3' overfiang of duplexes of 21 nt siRNAs. (A) Outline of the experimental strategy. The capped and polyadenylated sense 
target mRNA is depicted and the relative positions of sense and antisense siRNAs are shown. Eight series of duplexes according to the eight different 
antisense strands were prepared. The siRNA sequences and the number of overhanging nucleotides were changed in I nt steps. (B) Normalized 
relative luminescence of target luciferase (Photinus pyralis, Pp-luc) to control luciferase {Renitta reniformis, Rr-luc) in D.melanogaster embryo lysate 
in the presence of 5 nM blunt-ended dsRNAs (Elbashir et ai, 2001b). The luminescence ratios determined in the presence of dsRNA were normalized 
to the ratio obtained for a buffer control (bu; black bar). Normalized ratios less than I indicate specific interference. (C-J) Normalized interference 
ratios for eight series of 21 nt siRNA duplexes. The sequences of siRNA duplexes are depicted above the bar graphs. Each part shows the interference 
ratio for a set of duplexes formed with a given antisense guide siRNA and five different sense siRNAs. The number of overhanging nucleotides 
(3' overhang, positive numbers; 5' overhang, negative numbers) is indicated on the x-axis. Data points were averaged from at least three independent 
experiments. Error bars represent standard deviations. 
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Fig. 2. VariaUoQ of the length of the sense strand of siRNA duplexes. (A) Representation of the experiment Three 21 nt antisense strands were paired 
with eight sense siRNAs. The siRNAs were changed in length at their 3' end. The 3' overhang of the antisense siRNA was 1 nt (B). 2 nt (C) or 
3 nt (D), while the sense siRNA overhang was varied for each series. The sequences of the siRNA duplexes and the corresponding interference ratios 
are indicated. 



RNAi. Comparison of the efficiency of RNAi between 
long dsRNAs (Figure IB) and the most effective 21 nt 
siRNA duplexes (Figure IE, G and H) indicates that a 
single siRNA duplex at 100 nM concentration can be as 
effective as 5 nM 504 bp dsRNA. 

Length variation of the sense siRNA paired to an 
invariant 21 nt antisense siRNA 

In order to investigate the effect of the length of siRNAs 
on RNAi, we prepared three series of siRNA duplexes, 
combining three 2 1 nt antisense strands with eight 18-25 nt 
sense strands. The 3' overhang of the antisense siRNA was 
fixed to 1, 2 or 3 nt in each siRNA duplex series, while the 



sense siRNA was varied at its 3' end (Figure 2A). 
Independently of the length of the sense siRNA, we 
found that duplexes with 2 nt 3' overhang of antisense 
siRNA (Figure 2C) were more active than those with 1 or 
3 nt 3' overhang (Figure 2B and D). In the first series, with 
1 nt 3' overhang of antisense siRNA, duplexes with 21 and 
22 nt sense siRNAs, carrying a 1 and 2 nt 3' overhang of 
sense siRNA, respectively, were most active. Duplexes 
with 19-25 nt sense siRNAs were also able to mediate 
RNA, but to a lesser extent. Similarly, in the second series, 
with 2 nt overhang of antisense siRNA, the 21 nt siRNA 
duplex with 2 nt 3' overhang was most active and any other 
combination with the 18-25 nt sense siRNAs was active to 
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Fig. 3. Variation of the length of siRNA duplexes with preserved 2 nt 3' overhangs. (A) Graphic representation of the experiment. The 21 nt siRNA 
duplex is identical in sequence to the one shown in Figures IH and 2C. The siRNA duplexes were extended to the 3' side of the sense siRNA (B) or 
the 5' side of the sense siRNA (C). The siRNA duplex sequences and the respective interference ratios are indicated. 



a significant degree. In the last series, with 3 nt antisense 
siRNA 3' overhang, only the duplex with a 20 nt sense 
siRNA and 2 nt sense 3' overhang was able to reduce target 
RNA expression. Together, these results indicate that the 
length of the siRNA as well as the length of the 3' 
overhang are important, and that duplexes of 2 1 nt siRNAs 
with 2 nt 3' overhang are optimal for RNAi. 



Length variation of siRNA duplexes with a 
constant 2 nt 3' overhang 

We then examined the effect of simultaneously changing 
the length of both siRNA strands by maintaining symmet- 
rical 2 nt 3' overhangs (Figure 3A). Two series of siRNA 
duplexes were prepared, including the 21 nt siRNA duplex 
of Figure 1H as reference. The length of the duplexes was 
varied between 20 and 25 bp by extending the base-paired 
segment at the 3' end of the sense siRNA (Figure 3B) or at 
the 3' end of the antisense siRNA (Figure 3C). 



Duplexes of 20-23 bp caused specific repression of 
target luciferase activity, but the 21 nt siRNA duplex was 
at least 8-fold more efficient than any of the other 
duplexes. siRNA duplexes of 24 and 25 nt did not result in 
any detectable interference. Sequence-specific effects 
were minor as variations on both ends of the duplex 
produced similar effects. 

2'deoxy- and 2'-0-methyl-modified 
siRNA duplexes 

To assess the importance of the siRNA ribose residues for 
RNAi, duplexes with 21 nt siRNAs and 2 nt 3' overhangs 
with 2'-deoxy- or 2'-0-raethyl-modified strands were 
examined (Figure 4). Substitution of the 2 nt 3' overhangs 
by 2 , -deoxynucleotides had no effect and even the 
replacement of two additional ribonucleotides by 2'- 
deoxyribonucleotides adjacent to the overhangs in the 
paired region produced significantly active siRNAs. Thus, 
8 out of 42 nt of a siRNA duplex were replaced by DNA 
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Fig. 4. Substitution of the 2'-hydroxyl groups of the siRNA ribose 
residues. The 2'-hydroxyl groups (OH) in the strands of siRNA 
duplexes were replaced by 2'-deoxy (d) or 2'-0-methyl (Me). 2 and 4 at 
2'-deoxy substitutions at the 3' ends are indicated as 2- and 4-nt d, 
respectively. Uridine residues were replaced by 2'-deoxythymidine. 



A 

7m GpppO 



sense target 



1 

2 

3 
4 

5 



f 



7* 




UODi 



antisense target 



residues without loss of activity. Complete substitution of 
one or both siRNA strands by 2'-deoxy residues, however, 
abolished RNAi, as did complete substitution by 2'-<7- 
methyl residues. 



Definition of target RNA cleavage sites 

Target RNA cleavage positions were previously deter- 
mined for 22 nt siRNA duplexes and for a 21 and 22 nt 
duplex (Elbashir et al. y 2001b). The position of target RNA 
cleavage was located in the centre of the region covered by 
the siRNA duplex, 1 1 or 12 nt downstream of the first 
nucleotide that was complementary to the 21 or 22 nt 
siRNA guide sequence. Five distinct 21 nt siRNA 
duplexes with 2 nt 3' overhang (Figure 5 A) were incubated 
with 5' cap-labelled sense or antisense target RNA in 
D.melanogaster lysate (Tuschl et a/., 1999; Zamore et a/., 
2000). The 5' cleavage products were resolved on 
sequencing gels (Figure 5B). The amount of sense target 
RNA cleaved correlated with the efficiency of siRNA 
duplexes determined in the translation -based assay, and 
siRNA duplexes 1, 2 and 4 (Figures 5B, IE, G and H) 
cleaved target RNA faster than duplexes 3 and 5 
(Figures 5B, ID and F). Notably, the sum of radioactivity 
of the 5' cleavage product and the input target RNA were 
not constant over time and the 5' cleavage products did not 
accumulate. Presumably, the cleavage products, once 
released from the siRNA-endonuclease complex, were 
rapidly degraded due to the lack of either the poly(A) tail 
or the 5' cap. 

The cleavage sites for both sense and antisense target 
RNAs were located in the middle of the region spanned by 
the siRNA duplexes. The cleavage sites for each target 
produced by the five different duplexes varied by I nt 
according to the 1 nt displacement of the duplexes along 
the target sequences. The targets were cleaved precisely 
i I nt downstream of the target position complementary to 
the 3'-most nucleotide of the sequence-complementary 
guide siRNA (Figure 5). 

In order to determine whether the 5' or the 3' end of the 
guide siRNA sets the ruler for target RNA cleavage, we 
devised the experimental strategy oudined in Figure 6A 
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Fig. 5. Mapping of sense and antisense target RNA cleavage by 21 nt 
siRNA duplexes with 2 nt 3' overhangs. (A) Representation of 32 P 
(asterisk) cap-labelled sense and antisense target RNAs and siRNA 
duplexes. The position of sense and antisense target RNA cleavage is 
indicated by triangles on top and below the siRNA duplexes, 
respectively. (B) Mapping of target RNA cleavage sites. After 2 h 
incubation of 10 nM target RNA with 100 nM siRNA duplex in 
D.melanogaster embryo lysate. the 5' cap-labelled substrate and the 5' 
cleavage products were resolved on 6% sequencing gels. Length 
markers were generated by partial RNase Tl digestion (Tl) and partial 
alkaline hydrolysis (OH) of the target RNAs. The bold lines to the left 
of the images indicate the region covered by the siRNA strands 1 and 5 
of the same orientation as the target. 
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Fig. 6. The 5' end of a guide siRNA defines the position of target RNA cleavage. (A and B) Representation of the experimental strategy. The 
antisense siRNA was the same in all siRNA duplexes, but the sense strand was varied between 18 and 25 nt by changing the 3' end (A) or 18 and 
23 nt by changing the 5' end (B). The position of sense and antisense target RNA cleavage is indicated by triangles on top and below the siRNA 
duplexes, respecti vely. (C and D) Analysis of target RNA cleavage using cap-labelled sense (top) or antisense (bottom) target RNAs. The residual 
amount of targeted substrate and the cap-labelled 5' cleavage products are shown. The sequences of the siRNA duplexes ore indicated and the length 
of the sense siRNA strands is marked on top. The control lane, marked with a dash in (C), shows target RNA incubated in absence of siRNAs. 
Markers were as described in Figure 5. The arrows in (D), bottom, indicate the target RNA cleavage sites that differ by I nt. 



and B. A 2 1 nt antisense siRNA, which was kept invariant 
for this study, was paired with sense siRNAs that were 



modified in length at either of their 5' or 3' ends. The 
position of sense and antisense target RNA cleavage was 
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determined as described above. Changes in the 3' end of 
the sense siRNA, monitored for I nt 5' overhang to 6 nt 3' 
overhang, did not affect either the position of sense nor 
antisense target RNA cleavage (Figure 6C). Changes in the 
5' end of the sense siRNA did not affect the sense target 
RNA cleavage (Figure 6D, top), as expected, because the 
antisense siRNA was unchanged. The residual amount of 
uncleaved sense target RNA (Figure 6C and D top) 
correlated with the efficiency of siRNA duplexes deter- 
mined in translation-based assays (Figure 2C and data not 
shown), but did not correlate with the amount of detected 
cleavage product. Accumulation of cleavage products was 
more pronounced for the longer and less efficient siRNA 
duplexes, suggesting that product release may have 
become rate limiting. Because the antisense siRNA was 
kept unchanged while the sense siRNA was varied, an 
alteration in product release implies a role of the sense 
siRNA strand in the target RNA degradation process. 

Changes in the 5' end of the sense siRNA, in contrast to 
its 3' end, strongly affected antisense target RNA cleavage 
(Figure 6D, bottom). The antisense target was only 
cleaved when the sense siRNA was 20 or 21 nt in size. 
The position of cleavage differed by 1 nt, suggesting that 
the 5' end of the target-recognizing siRNA sets the ruler 
for target RNA cleavage. This position is located between 
nucleotide 10 and li when counting in an upstream 
direction from the target nucleotide paired to the 5'-most 
nucleotide of the guide siRNA (see also Figure 5A). 

Sequence effects and 2-deoxy substitutions in the 
3' overhang 

The 2 nt 3' overhang is critical for siRNA function. We 
wanted to know whether the sequence of the overhanging 
nucleotides contributes to target recognition or is only a 
feature required for reconstitution of the endonuclease 
complex (RISC or siRNP). We synthesized sense and 
antisense siRNAs with AA, CC, GG, UU and UG 3' 
overhangs and included the 2'-deoxy modifications TdG 
and TT (T, 2'-deoxy thymidine; dG, 2'-deoxyguanosine). 
The wild-type siRNAs contained AA in the sense 3' 
overhang and UG in the antisense 3' overhang (AA/UG). 
All siRNA duplexes were functional in the interference 
assay and reduced target expression at least 5-fold 
(Figure 7). The most efficient siRNA duplexes, which 
reduced target expression > 10-fold, were of the sequence 
type NN/UG, NN/UU, NN/TdG and NN/TT (N, any 
nucleotide). siRNA duplexes with an antisense siRNA 3' 
overhang of AA, CC or GG were less active by a factor of 
2-4 when compared with the wild-type sequence UG or 
the mutant UU. This reduction in RNAi efficiency is likely 
to be due to the contribution of the penultimate 3' 
nucleotide to sequence-specific target recognition, as the 
3'-terminal nucleotide was changed from G to U without 
effect. 

Changes in the sequence of the 3' overhang of the sense 
siRNA did not reveal any sequence-dependent effects, 
which was not surprising because the sense siRNA is not 
expected to contribute to the sequence-specific recognition 
of the sense target mRNA. 

Sequence specificity of target recognition 

In order to examine the sequence specificity of target 
recognition, we introduced sequence changes into the 
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Fig. 7. Sequence variation of the 3' overhang of siRNA duplexes. 
The 2 nt 3' overhang (NN, in grey) was changed in sequence and 
composition as indicated (T, 2'-deoxythymidine; dG, 2'-deoxy- 
guanosine; asterisk, wild-type siRNA duplex). Normalized interference 
ratios were determined as described in Figure 1 . The wild-type 
sequence is the same as depicted in Figure 4. 

paired segments of siRNA duplexes and determined the 
efficiency of silencing. Sequence changes were introduced 
by inverting short segments of 3 or 4 nt or inducing point 
mutations (Figure 8). The sequence changes in one siRNA 
strand were compensated for in the complementary siRNA 
strand to avoid perturbing the base-paired siRNA duplex 
structure. The sequence of all 2 nt 3' overhangs was TT to 
reduce costs of synthesis. The TT/TT reference siRNA 
duplex was comparable in RNAi to the wild-type siRNA 
duplex AA/UG (Figure 7). The ability to mediate reporter 
mRNA destruction was quantified using the translation- 
based luminescence assay. Duplexes of siRNAs with 
inverted sequence segments showed dramatically reduced 
ability for targeting the firefly luciferase reporter 
(Figure 8). The sequence changes located between the 3' 
end and the middle of the antisense siRNA completely 
abolished target RNA recognition, but mutations near the 
5' end of the antisense siRNA exhibited a small degree of 
silencing. Transversion of the AU base pair located 
directly opposite the predicted target RNA cleavage site 
or 1 nt further away from the predicted site prevented 
target RNA cleavage, therefore indicating that a single 
mutation within the centre of a siRNA duplex discrimin- 
ates between mismatched targets. 

Discussion 

siRNAs are valuable reagents for inactivation of gene 
expression, not only in insect cells but also in mammalian 
cells, with a great potential for therapeutic application 
(Elbashir et a/., 2001a). We have systematically analysed 
the structural determinants of siRNA duplexes required 
to promote efficient target RNA degradation in 
D.melanogaster embryo lysate, thus providing rules for 
the design of most potent siRNA duplexes. A perfect 
siRNA duplex is able to silence gene expression with an 
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Fig. 8. Sequence specificity of target recognition. The sequences of the 
mismatched siRNA duplexes are shown, modified sequence segments 
or single nucleotides are shaded in grey. The reference duplex (ref) and 
the siRNA duplexes 1-7 contain 2'-deoxythymidine 2 nt overhangs. 
The silencing efficiency of the thymidine- modi tied reference duplex 
was comparable to the wild-type sequence (Figure 7). Normalized 
interference ratios were determined as described in Figure 1. 



efficiency comparable to a 500 bp dsRNA, given that 
comparable quantities of total RNA are used. 

The siRNA user guide 

Efficiently silencing siRNA duplexes are composed of 
21 nt sense and 21 nt antisense siRNAs and must be 
selected to form a 19 bp double helix with 2 nt 3'- 
overhanging ends. 2'-deoxy substitutions of the 2 nt 3'- 
overhanging ribonucleotides do not affect RNAi, but help 
to reduce the costs of RNA synthesis and may enhance 
RNase resistance of siRNA duplexes. More extensive 2'- 
deoxy or 2'-0-methyl modifications reduce the ability of 
siRNAs to mediate RNAi, probably by interfering with 
protein association for siRNP assembly. 

Target recognition is a highly sequence-specific pro- 
cess, mediated by the siRNA complementary to the target. 
The 3'-most nucleotide of the guide siRNA does not 
contribute to the specificity of target recognition, while the 
penultimate nucleotide of the 3' overhang affects target 



RNA cleavage and a mismatch reduces RNAi 2- to 4-fold. 
The 5' end of the guide siRNA also appears more 
permissive for mismatched target RNA recognition when 
compared with the 3' end. Nucleotides in the centre of the 
siRNA, located opposite to the target RNA cleavage site, 
are important specificity determinants and even single 
nucleotide changes reduce RNAi to undetectable levels. 
This suggests that siRNA duplexes may be able to 
discriminate mutant or polymorphic alleles in gene 
targeting experiments, which may become an important 
feature for future therapeutic developments. 

Sense and antisense siRNAs, when associated with the 
protein components of the endonuc lease complex or its 
commitment complex, were suggested to play distinct 
roles; the relative orientation of the siRNA duplex in this 
complex defines which strand can be used for target 
recognition (Elbashir et a/., 2001b). Synthetic siRNA 
duplexes with an equal number of overhanging nucleotides 
have dyad symmetry with respect to the double-helical 
structure, but not with respect to sequence. The association 
of siRNA duplexes with the RNAi proteins in the 
D.melanogaster lysate leads to the formation of two 
asymmetric complexes. Ln such hypothetical complexes, 
the chiral environment is distinct for sense and antisense 
siRNA, hence their function. The prediction obviously 
does not apply to palindromic siRNA sequences or to 
RNAi proteins that could associate as homodimers. To 
minimize sequence effects that may affect the ratio of 
sense- and antisense- targeting siRNPs, we suggest using 
siRNA sequences with identical 3 '-overhanging se- 
quences. We recommend adjusting the sequence of the 
overhang of the sense siRNA to that of the antisense 3' 
overhang because the sense siRNA does not have a target 
in typical knock-down experiments. Asymmetry in tie 
reconstitution of sense- and antisense-cleaving siRNPs 
could be, partially, responsible for the variation in RNAi 
efficiency observed for various 21 nt siRNA duplexes 
with 2 nt 3' overhangs used in this study (Figure 1). 
Alternatively, the nucleotide sequence at the target site 
and/or the accessibility of the target RNA structure may be 
responsible for the variation in efficiency observed for 
these siRNA duplexes. It should be noted that all siRNAs 
used in this study are derived from a short region of one 
gene. Thus, it is more likely that differences in siRNA 
efficiency are a consequence of the primary sequences of 
the siRNAs and the respective target sites, rather than the 
secondary or tertiary structure of the targeted RNA. 

Natural siRNAs versus synthetic siRNAs 

In D.melanogaster, siRNA duplexes are produced in vitro 
and in vivo from long dsRNAs (Hammond et a/., 2000; 
Yang et aL, 2000; Zamore et a/., 2000). About 45% of 
these short RNAs are precisely 21 nt long, 28% are 22 nt 
long and a few percent are shorter or longer RNAs 
(Elbashir et aL 9 2001b). This length distribution correlates 
with our finding that 21 nt siRNA duplexes are the most 
efficient mediators of mRNA degradation. Beside the 
length, the paired structure and overhang are also import- 
ant. This structural feature may explain why siRNA 
duplexes isolated from the dsRNA processing reaction 
under denaturing conditions were less potent for RNAi 
than longer dsRNAs that were processed to siRNAs during 
the targeting reaction (Zamore et aL f 2000). Presumably, 
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dcnaturation followed by renaturation favoured the for- 
mation of the thermodynamically more stable, blunt- 
ended, but less active, siRNA duplexes. Isolation of 
siRNAs under native conditions does not reduce siRNA 
activity (Nykanen et aL, 2001). 

Production of siRNAs from long dsRNA requires the 
RNase III enzyme dicer (Bernstein et aL, 2001). Dicer is a 
bidentate RNase III, which also contains an ATP-depend- 
ent RNA helicase domain and a PAZ domain, presumably 
important for dsRNA unwinding and mediation of 
protein-protein interactions, respectively (Cerutti et aL, 
2000; Bernstein et aL, 2001). Dicer is evolutionarily 
conserved in worms, flies, plants, fungi and mammals 
(Matsuda et aL, 2000), and has a second cellular function 
important for the development of these organisms (Ray 
et aL, 1996; Jacobsen et aL, 1999; Grishok et aL, 2001; 
Hutvagner et aL, 2001; Knight and Bass, 2001). At 
present, it is uncertain whether dicer activity in species 
other than D.melanogaster produces siRNAs of predom- 
inantly 21 nt in length. The estimates of siRNA size vary in 
the literature between 21 and 25 nt (Hamilton and 
Baulcombe, 1999; Hammond et aL, 2000; Hutvagner 
etaL, 2000; Parrish et aL, 2000; Yang etaL, 2000; Zamore 
et aL, 2000; Elbashir et aL, 2001b). 

In a recent study of the effect of siRNA length in 
mammalian cells (primary mouse embryonic fibroblasts, 
293 and HeLa cells), duplexes of 21-27 nt siRNAs with 
2 nt 3' overhangs were directed against different co- 
transfected reporter genes (Caplen et aL, 2001). Duplexes 
of 22 and 23 nt siRNAs were found to be slightly more 
efficient in triggering sequence-specific gene silencing 
than 21 nt siRNA duplexes. In our hands, using the dual 
luciferase assay system in HeLa cells, 21 nt siRNA 
duplexes with 2 nt 3' overhang are 2- to 3-fold more 
efficient than 20 or 22-25 nt siRNA duplexes (data not 
shown), therefore recapitulating the results obtained from 
the D.melanogaster biochemical system. In contrast to the 
D.melanogaster system, siRNA duplexes >23 nt in length 
are still triggering some RNAi in HeLa cells and also in 
C.elegans (Caplen et aL, 2001). However, it remains to be 
determined whether the RNA strands finally incorporated 
into the active endonuclease complex are of the initially 
provided length. It is possible that exonucleases present in 
C.elegans and mammalian cells trim longer siRNAs to 
their optimum length and that these exonucleases are 
absent from D.melanogaster lysate. 

The functional anatomy of long dsRNAs as a trigger for 
RNAi was analysed previously in C.elegans (Parrish et aL, 
2000). Activation of RNAi by injection of long dsRNA 
requires at least two steps: dsRNA processing by dicer 
RNase HI and siRNP or RISC formation. Substitution of 
one of the strands of the long dsRNA by DNA abolished 
RNAi and even the substitution of C by dC or U by dT in 
only one of the strands caused a substantial decrease in 
RNAi. Because introduction of 2'-fluoro modifications 
into long RNA had no effect, it was suggested that an A- 
form double helical structure was important for triggering 
RNAi (Parrish et aL, 2000). We have been able to 
substitute eight ribose residues of a siRNA duplex by 2'- 
deoxyribose residues without substantial reduction of 
RNAi, although it should be noted the 2'-deoxy modifi- 
cations were clustered at the 3' end of the siRNAs, 
including the 2 nt 3' overhangs. It is possible that the four 



2'-deoxy modifications, which are located in the paired 
region at the end of the helix, do not affect the overall A- 
form helical structure and do not strongly compromise 
RISC formation. Complete modification of one or both 
siRNA strands by 2'-deoxyribose, however, abolished 
RNAi. Interestingly, substitution by 2'-0-methylribose, 
which adopts the ribose sugar pucker, also abolished 
RNAi, probably because methylation of the 2 / -hydroxyls 
blocked hydrogen bond formation or introduced steric 
hindrance. 

It was recently demonstrated that a 5'-phosphate on the 
target-complementary strand of a siRNA duplex is 
required for siRNA function and that ATP is used to 
maintain the 5'-phosphates of the siRNAs (Nykanen et aL, 
2001). However, 5'-phosphorylation of fully 2'-deoxy- or 
2'-0-methyl-modified siRNA strands was not able to 
restore RNAi (data not shown). Unmodified siRNA 
duplexes with free 5'-hydroxyls and 2 nt 3' overhangs 
are readily phosphorylated in D.melanogaster embryo 
lysate (Nykanen et aL, 2001). In this respect, it should be 
noted that our reported RNAi efficiencies were determined 
by pre-incubating the siRNA duplexes for 15 min in 
D.melanogaster lysate before adding target and control 
mRNAs, thus providing sufficient time for 5'-phosphoryl- 
ation of siRNA duplexes to occur. Comparison of the 
RNAi efficiencies of 5'-phosphorylated and 5'-non-phos- 
phorylated siRNAs (for duplexes shown in Figures IE, F 
and 2C) did not reveal any sizeable differences (data not 
shown). 

Conclusions 

We have performed an extensive analysis of the length, 
sequence and structure of siRNA duplexes in 
D.melanogaster embryo lysate. Duplexes of 21 nt 
siRNAs with 2 nt 3' overhangs were shown to be the 
most efficient triggers of RNAi-based mRNA degradation. 
The target recognition is a highly sequence-specific 
process, although not all positions of a guide siRNA 
contribute equally to specificity. These results are import- 
ant for the design of efficient siRNAs in order to silence 
genes in D.melanogaster and provide a basis for similar 
studies in other organisms. 



Materials and methods 

RNA preparation and RNAi assay 

Chemical RNA synthesis, annealing and luciferase-based RNAi assays 
were performed as described previously (Tuschl et at., 1999; Zamore 
et aL, 2000; Elbashir et aL, 2001b). Synthetic RNAs were gel purified 
after deprotection. The formation of siRNA duplexes was verified by 
agarose gel electrophoresis using 4% NuSieve GTG agarose (BMA, 
Rockland, ME) in 0.5 X TBE butter. All siRNA duplexes were directed 
against firefly luciferase and the luciferase mRNA sequence was derived 
from pGEM-luc (DDBJ/EM BL/GeoBank accession No. X65316) as 
described (Tuschl et aL, 1999). The siRNA duplexes were incubated in a 
D.melanogaster RNAi/translalion reaction for 15 min prior to addition of 
mRNAs. Translation-based RNAi assays were performed at least in 
triplicate. 

For mapping of sense target RNA cleavage, a 177 nt transcript was 
generated, corresponding to the firefly luciferase sequence between 
positions 1 13 and 273 relative to the start codon, followed by the 17 nt 
complement of the SP6 promoter sequence (Elbashir et aL, 2001b). For 
mapping of an ti sense target RNA cleavage, a 166 at transcript was 
produced from a template, which was amplified from plasmid sequence 
by PCR using the 5' primer TAATACGACTCACTATAGAG CCCATA- 
TCGTTTCATA (T7 promoter underlined) and 3' primer AGAG 
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* G ATGG AACCGCTGG . The target sequence corresponds to the com- 
plement of the firefly luciferase sequence between positions 50 and 215 
relative to the start codon. Guanyly! transferase labelling was performed 
as described previously (Zamore et ai, 2000). For mapping of target RNA 
cleavage. 100 oM si RNA duplex, was incubated with 5-10 nM target 
RNA in O.melanogaster embryo lysate under standard conditions 
(Zamore et ai, 2000) for 2 h at 25°C. The reaction was stopped by the 
addition of 8 vols of proteinase K buffer [200 mM Tris-HCI pH 7.5, 
25 mM EDTA, 300 mM NaCI, 2% (w/v) SOS]. Proteinase K (dissolved in 
water; Merck) was added to a final concentration of 0.6 mg/ml. The 
reactions were then incubated for 15 min at 65°C. extracted with phenol/ 
chloroform/isoamyl alcohol (25:24:1) and precipitated with 3 vols of 
ethanol. Samples were loaded on 6% sequencing gels. Length standards 
were generated by partial RNase Tl digestion and partial base hydrolysis 
of the cap-labelled sense or antisense target RNAs. 
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ogy). respectively. Staining specificity was controlled by single staining, as well as by using 
secondary antibodies in the absence of the primary stain. 

Generation of target cells 

Target cells displaying a membrane-integral version of either wild-type HELor a mutant 10 
exhibiting reduced affinity for HyHELIO ((R JI , D ,0 \ G 10 \ N' 0, | designated HEL*) were 
generated by transfecting mouse J558L plasmacytoma cells with constructs analogous to 
those used 10 for expression of soluble HEUHEL', except that 14 Ser/Glycodons, the H2K b 
transmembrane region, and a 23-codon cytoplasmic domain were inserted immediately 
upstream of the termination codon by polymerase chain reaction. For mHEL-GFP, we 
included the EGFP coding domain in the Ser/Gly linker. Abundance of surface HEL was 
monitored by flow cytometry and radio labelled -antibody binding using HyHEL5 and 
DI.3 HEL-specific monoclonal antibodies, for which the mutant HELs used in this work 
show unaltered affinities 10 . 

Interaction assays 

For D-cell/target interaction assays, splenic B cells from 3-83 or MD4 transgenic mice 2 " 11 
carrying (IgM + IgD) OCRs specific for HEL or H2K k /H2K b werc freshly purified on 
Lympholyic and incubated with a twofold excess of target cells in RPMI. 50 mM HEPES 
pH 7.4, for the appropriate time at 37 W C before being applied to polylysinc-coated slides. 
Cells were fixed in 4% paraformaldehyde/? BS or methanol and permeabilized with PBS/ 

0. 1. Triton X-100 before immunofluorescence. We acquired confocal images using a 
Nikon E800 microscope attached to BioRad Radiance Plus scanning system equipped with 
488-nm and 543-nm lasers, as well as differential interference contrast for transmitted 
light. GFP fluorescence in living cells in real time was visualized using a Radiance 2000 and 
Nikon E300 inverted microscope. Images were processed using BioRad Lasersharp 1024 or 
2000 software to provide single pISne images, confocal projections or slicing. 

Antigen presentation 

Presentation of HEL epitopes to T-cell hybridomas 2G7 (specific for l-l£ fc (HEL'~" , |) and 
IE5 (specific for l-E ,, |HEL ,( " u, "'|) by transfectants of the LK35.2 B-cetl hybridoma 
expressing an HEL-specific IgM BCR was monitored as described 10 . 

Received 12 December 2000; accepted 30 March 2001. 

1. Lauiavccchia. A. Anligcn-jpecific interaction between T and D cells. Nature Mi, 537-539 (1985). 

2. Klaus. G. G.. Humphrey, I. H.. Kunkl, A. & Dongworth, D. W. The follicular dendritic cell: its role in 
antigen presentation in the generation of immunological memory. Immunol Rev. 53, 3-28 (1980). 

3. Tew, I G.. Kosco, M. H.. Burton. G. F. & Szakal. A. K. Follicular dendritic cells as accessory cells. 
Immunol Rev. 117. 185-211 (1990). 

4. Kosco- Vilbois, M. H., Gray, D.. Scheidcgger. D. 8t lulius, M. Follicular dendritic cells help resting B 
cells to become effective antigen-presenting cells: induction of B7/BBI and upregulation of major 
histocompatibility complex class II molecules. /. Exp. Med. 178, 2055-2066 ( 1993). 

5. Schamd, W. W. 8c Rclh. M. Monomcric and oligomcric complexes of the B cell antigen receptor. 
Immunity 13, S- 1 4 (2000). 

6. Taylor. R. B., Dutfus, W. P. H., Raff. M. C. 6V dc Pctris. S. Redistribution and pinoc)1osis of lymphocyte 
surface immunoglobulin molecules induced by anti-immunoglobulin antibody. Nature 233, 225-227 
(1971). 

7. Schreincr, G. F. 8c Unanuc. E. R. Capping and the lymphocyte: models for membrane reorganisation. 
/. Immunol. 119, 15-19-1551 (1977). 

8. Cheng. P. G, Dykslra. M. L, Mitchell. R. N. 8c Pierce, S. K. A role for lipid rafts in B cell antigen 
receptor signaling and antigen targeting. /. Exp. Med. 190, 1549-1560 ( 1999). 

9. Weintrauh, B. C. et at. Entry of B cell receptor into signaling domains is inhibited in tolerant 0 cells. 
/. Exp. Med. 191. 1443-1448 (2000). 

10. Batista, F. D. & Ncubcrger, M. S. Affinity dependence ol the B cell response to antigen: a threshold, a 
ceiling, and the importance of off-rate. Immunity 8. 751-759 (1998). 

1 1. Nemazce. D. & Burki, K. Clonal deletion of B lymphocytes in a transgenic mouse bearing anti-MHC 
cbs$ I antibody genes. Nature 337. 562-566 (1969). 

12. Hartley. S. B. et at. Elimination from peripheral lymphoid tissues of self-reactive B lymphocytes 
recognizing membrane-bound antigens. Nature 353, 765-769 (1991 ). 

1 3. Dustin. M. L et at. Low affinity interaction of human and rat T cell adhesion molecule CD2 with its 
ligands aligns adhering membranes to achieve high physiological affinity. /. Biol. Chan. 272, 30889- 
30898(1997). 

14. Lang, |. et al B cells arc exquisitely sensitive to central tolerance and receptor editing by ultralow 
alfinity. membrane-bound antigen. /. Exp. Med. 184, 1685-1697 (1996). 

1 5. Valitutti. S.. Mullcr, S.. Cella. M„ Padovan, E. & Unzavecchia. A. Serial triggering of many T-cell 
receptors by a few pep tidc-M 1 1C complexes. Nature 375. 148-151 (1995). 

16. Monks, C R.. Freiberg, D. A., Kupfer. H.. Sciaky. N. 8c Kupfcr. A, Three-dimensional segregation of 
supramolccular activation clusters in Tcells. Nature 395, 82-86 (1998). 

17. Wulfing. C. 8t Davis. M. M. A rccepior/cyloskclcial movement triggered by costimulation during T 
cell activation. Science 182, 2266-2269 (1998). 

18. Grakoui. A. et al. The immunological synapse: a molecular machine controlling T cell activation. 
Science 285. 221-227 (1999). 

19. Uuptn. O.. Zaru. R.. Laroche. T. Muller. S. 8c Valitutti, S. Exclusion of CD45 from the T-cell receptor 
signaling area in antigen -stimulated T lymphocytes. Curt. Biol. 10, 277-280 (2000). 

20. Cagan. R. U. Kramer, H.. Hart, A. C & Zipursky. S. L The bride of sevenlcss and sevenlew interaction: 
internalization of a transmembrane ligand. CW/69, 393-399 (1992). 

21. Huang, I F. <f al. TCR-mediated internalization of peplide-MHC complexes acquired by T cells. 
Science 286, 952-954 (1999). 

22. Hwang. I. et al. T cells can use either T cell receptor or CD28 receptors to absorb and internalize cell 
surface molecules derived from antigen-presenting cells, /. Exp. Med. 191, 1 137-1 148 (2000). 

23. Batista. F. D. 8t N'euberger. M. S. B cells extract and present immobilized antigen: implications for 
affinity discrimination. EMBO f. 19. 513-520 (2000). 

24. Castcn. L A., Lakey, E. K.. Jdachich, M. L. Margoliash. E. 8c Pierce. S. K. Anti-immunoglobulin 



augments the B-cetl antigen-presentation function independently of internalization of receptor- 
antigen complex. Proc. Natl Acad. Sci. USA 82. 5890-5894 ( 1985). 

25. Siemasko, K.. Eisfddcr, B. |.. Williamson. E-. Kabak. S. 8c dark. M. R. Signals from the B lymphoqne 
antigen receptor regulate MHC class (I containing late endosomes. /. Immunol. 160, 5203-5208 
(1998). 

26. Smt, K. ef al. Efficient presentation of multivalent antigens targeted to various cell surface molecules 
of dendritic cells and surface Ig of antigen -specific 0 cells. /. Immunol 161, 6059-6067 (1998). 

27. Green. S. M, Lowe, A. D.. Partington, J. 6c Karn, |. Transformation of growth factor-dependent 
myeloid stem cells with retroviral vectors carrying c-myc. Oncogene 737-751 (1989). 

28. Russell, D. M. et al Peripheral deletion of self-reactive B cells. Nature 354. 308-3 II ( 1991 ). 

29. Goodnow, C C. et al. Altered immunoglobulin expression and functional silencing of self-reactive B 
lymphocytes in transgenic mice. Nature 334, 676-682 (1988). 

30. Aluvihare. V. R.. Khamlichi, A. A.. Williams, C. T. Adorini. L & Neuberger, M. S. Acceleration of 
intracellular targeting of antigen by the B-ccll antigen receptor: importance depends on the nature of 
the antigen-antibody interaction. EMBO J. 16. 3553-3562 ( 1997). 

Supplementary information is available on Natures World-Wide Web site 
(http://wwv.naiure.com) or as paper copy from the London editorial office of Nature. 

Acknowledgements 

We thank D. Amos and S. Reichelt for help and advice with confocal microscopy, and 
S. Munro for helpful discussions. We are indebted those who provided antibodies, 
transgenic mice and cell lines. F.D.B. and D.I. were supported by the Arthritis Research 
Campaign and Studienstiftung des deutschen Volkcs. respectively. 

Correspondence and requests for materials should be addressed to F.D.B. 
(c-mail:fdb@mrc-lmb.cam.ac.uk) or M.S.N. (e-mail:msn<5>rnrc-lmb.carn.ac.uk) 



Duplexes of 21 -nucleotide RNAs 
mediate RNA interference in 
cultured mammalian cells 

Sayda M. Elbashir , Jens Harbortht, Winfried lendeckel*, 
Abdullah YalcifT, Klaus Webert & Thomas Tuschl* 

* Department of Cellular Biochemistry; and t Department of Biochemistry and 
Cell Biology, Max-Planck-Institutc for Biophysical Chemistry, Am Fassberg 11, 
D-37077 Gbttingcn, Germany 



RNA interference (RNAi) is the process of sequence-specific, 
post-transcriptional gene silencing in animals and plants, 
initiated by double-stranded RNA (dsRNA) that is homologous 
in sequence to the silenced gene 1 " 4 . The mediators of sequence- 
specific messenger RNA degradation are 21- and 22-nucleotide 
small interfering RNAs (siRNAs) generated by ribonuclease III 
cleavage from longer dsRNAs 5 " 9 . Here we show that 2 1 -nucleo- 
tide siRNA duplexes specifically suppress expression of endo- 
genous and heterologous genes in different mammalian cell 
lines, including human embryonic kidney (293) and HeLa 
cells. Therefore, 21 -nucleotide siRNA duplexes provide a new 
tool for studying gene function in mammalian cells and may 
eventually be used as gene-specific therapeutics. 

Uptake of dsRNA by insect cell lines has previously been shown to 
'knock-down* the expression of specific proteins, owing to 
sequence-specific, dsRNA-mediated mRNA degradation 6,10 " 12 . 
However, it has not been possible to detect potent and specific 
RNA interference in commonly used mammalian cell culture 
systems, including 293 (human embryonic kidney), NIH/3T3 
(mouse fibroblast), BHK-21 (Syrian baby hamster kidney), and 
CHO-K 1 (Chinese hamster ovary) cells, applying dsRNA that varies 
in size between 38 and 1,662 base pairs (bp) 10,12 . This apparent lack 
of RNAi in mammalian cell culture was unexpected, because RNAi 
exists in mouse oocytes and early embryos 13 * 14 , and because RNAi - 
related, transgenc-mediated co -suppression was also observed in 
cultured Rat- 1 fibroblasts' 5 . But it is known that dsRNA in the 
cytoplasm of mammalian cells can trigger profound physiological 
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uridine residues. The thymidine overhang was chosen because it 
reduces costs of RNA synthesis and may enhance nuclease resistance 
of siRNAs in the cell culture medium and within transfected cells. As 
in the Drosophila in vitro system (data not shown), substitution of 
uridine by thymidine in the 3' overhang was well tolerated in 
cultured mammalian cells (Fig. 2a, c, e, g and i), and the sequence 
of the overhang appears not to contribute to target recognition 9 . 

In co-transfection experiments, 25 nM siRNA duplexes were 
used (Figs 2 and 3; concentration is in respect to the final volume 
of tissue culture medium). Increasing the siRNA concentration to 
100 nM did not enhance the specific silencing effects, but started 
to affect transfection efficiencies, perhaps due to competition for 
liposome encapsulation between plasmid DNA and siRNA (data 
not shown). Decreasing the siRNA concentration to 1.5 nM did 
not reduce the specific silencing effect (data not shown), even 
though the siRNAs were now only 2- to 20-fold more concen- 
trated than the DNA plasmids; the silencing effect only vanishes 
completely if the siRNA concentration was dropped below 
0.05 nM. This indicates that siRNAs are extraordinarily powerful 
reagents for mediating gene silencing, and that siRNAs are 
effective at concentrations that are several orders of magnitude 
below the concentrations applied in conventional antisense or 
ribozyme gene-targeting experiments 20 . 

To monitor the effect of longer dsRNAs on mammalian cells, 50- 
and 500-bp dsRNAs that arc cognate to the reporter genes were 
prepared. As a control for nonspecific inhibition, dsRNAs from 
humanized GFP (hG) 2f was used. In these experiments, the reporter 
plasmids were co-transfected with either 0.2 1 u.g siRNA duplexes or 
0.21 u.g longer dsRNAs. The siRNA duplexes only reduced the 
expression of their cognate reporter gene, while the longer 
dsRNAs strongly and nonspecifically reduced reporter-gene expres- 
sion. The effects are illustrated for HeLa S3 cells as a representative 
example (Fig. 3a and b). The absolute luciferase activities, were 
decreased nonspecifically 1 0- to 20-fold by 50-bp dsRNA, and 20- to 
200-fold by 500-bp dsRNA co-transfection, respectively. Similar 
nonspecific effects were observed for COS-7 and NIH/3T3 cells. For 
293 cells, a 10- to 20-fold nonspecific reduction was observed only 
for 500-bp dsRNAs. Nonspecific reduction in reporter-gene expres- 
sion by dsRNA>30bp was expected as part of the interferon 
response 16 . Interestingly, superimposed on the nonspecific inter- 
feron response, we detect additional sequence-specific, dsRNA- 
mediated silencing. The sequence-specific silencing effect of long 
dsRNAs, however, became apparent only when the relative reporter- 
gene activities were normalized to the hG dsRNA controls (Fig. 3c). 
Sequence-specific silencing by 50- or 500-bp dsRNAs reduced the 
targeted reporter-gene expression by an additional 2- to 5-fold. 
Similar effects were also detected in the other three mammalian cell 
lines tested (data not shown). Specific silencing effects with dsRNAs 
(356-1,662 bp) were previously reported in CHO-K1 cells, but the 
amounts of dsRNA required to detect a 2- to 4-fold specific 
reduction were about 20-fold higher than in our experiments' 1 . 
Also, CHO-K1 cells appear to be deficient in the interferon 
response. In another report, 293, NIH/3T3 and BHK-21 cells were 
tested for RNAi using Iuciferase/j3-galactosidase (lacZ) reporter 
combinations and 829-bp specific lacZ or 717-bp nonspecific 
green fluorescent protein (GFP) dsRNA 10 . The lack of detected 
RNAi in this case may be due to the less sensitive luciferase/lacZ 
reporter assay and the length differences of target and control 
dsRNA. Taken together, our results indicate that RNAi is active in 
mammalian cells, but that the silencing effect is difficult to detect if 
the interferon system is activated by dsRNA > 30 bp. 

To test for silencing of endogenous genes, we chose four genes 
coding for cytoskeletal proteins: lamin A/C, lamin Bl, nuclear 
mitotic apparatus protein (NuMA) and vimentin 17 . The selection 
was based on the availability of antibodies needed to quantitate the 
silencing effect. Silencing was monitored 40 to 45 h after transfec- 
tion to allow for turnover of the protein of the targeted genes. As 



shown in Fig. 4, the expression of lamin A/C was specifically 
reduced by the cognate siRNA duplex (Fig. 4a), but not when 
nonspecific siRNA directed against firefly luciferase (Fig. 4b) or 
buffer (Fig. 4c) was used. The expression of a non-targeted gene, 
NuMA, was unaffected in all treated cells (Fig. 4d-f), demonstrat- 
ing the integrity of the targeted cells. The reduction in lamin A/C 
proteins was more than 90% complete as quantified by western 
blotting (Fig. 4j, k). We note that lamin A/C 'knock-out* mice are 
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figure 3 Effects of 21 - nucleotide siRNAs, 50-bp, and 500-bp dsRNAs on luciferase 
expression in HeLa cells. The exact length of the long dsRNAs in base pairs is indicated 
below the bars. Experiments were performed with pGL2-Control and pRL-TK reporter 
plasmids. The data were averaged from two independent experiments ± s.d. a, Absolute 
Pp-luc expression, plotted in arbitrary luminescence units (a.u.). b, flr-luc expression, 
plotted in arbitrary luminescence units, c, Ratios of normalized target to control luciferase. 
The ratios of luciferase activity for siRNA duplexes were normalized to a buffer control (Bu, 
black bars); the luminescence ratios for 50- or 500-bp dsRNAs were normalized to the 
respective ratios observed for 50- and 500-bp dsRNAs from humanized GFP (hG, black 
bars). We note that the overall differences in sequence between the 49- and 484 -bp GL2 
and GL3 dsRNAs are not sufficient to confer specificity for targeting GL2 and GL3 targets 
(43-nucleotide uninterrupted identity in 49-bp segment, 239-nudeotide longest 
uninterrupted Identity in 484 -bp segment) 30 . 
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viable for a few weeks after birth 23 and that the lamin A/C knock- 
down in cultured cells was not expected to cause cell death. Lamin A 
and C are produced by alternative splicing in the 3' region and 
arc present in equal amounts in the lamina of mammalian cells 
(Fig. 4j, k). Transfection of siRNA duplexes targeting lamin Bl and 
NuMA reduced the expression of these proteins to low levels (data 
not shown), but we were not able to observe a reduction in vimentin 
expression. This could be due to the high abundance of vimentin in 
the cells (several per cent of total cell mass) or because the siRNA 
sequence chosen was not optimal for targeting of vimentin. 

The mechanism of the 21 -nucleotide siRNA-mediated interfer- 
ence process in mammalian cells remains to be uncovered, and 
silencing might occur post-transcriptionally and/or transcription- 
ally. In Drosophila lysate, siRNA duplexes mediate post-transcrip- 
tional gene silencing by reconstitution of siRNA-protein complexes 
(siRNPs), which guide mRNA recognition and targeted cleavage <,,7,s . 
In plants, dsRNA-mediated post-transcriptional silencing has also 
been linked to DNA mcthylation, which may also be directed by 2 1 - 
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Figure 4 Silencing of nuclear envelope proteins lamin A/C in HeLa cells. Triple 
fluorescence staining of cells transfected with lamin A/C siRNA duplex (a, d, g). with GL2 
luciferase siRNA duplex (nonspecific siRNA control) (b. e, h), and with buffer only (c, f, i). 
a-c, Staining with lamin A/C specific antibody; d-f, staining with NuMA-specific 
antibody; g-i, Hoechst staining of nuclear chromatin. Bright fluorescent nuclei in a 
represent untransfected cells, j, k, Western blots of transfected cells using lamin A/C- { j) 
or vimentin-specific (k) antibodies. The Western blot was stripped and re-probed with 
vimentin antibody to check for equal loading of total protein. 



nucleotide siRNAs 24 . Methylation of promoter regions can lead to 
transcriptional silencing", but methylation in coding sequences 
does not 26 . DNA methylation and transcriptional silencing in 
mammals are well documented processes 27 , yet their mechanisms 
have not been linked to that of post-transcriptional silencing. 
Methylation in mammals is predominantly directed towards CpG 
dinudeotide sequences. There is no CpG sequence in the RL or 
lamin A/C siRNA, although both siRNAs mediate specific silencing 
in mammalian cell culture, so it is unlikely that DNA methylation is 
essential for the silencing process. 

Thus we have shown, for the first time, siRNA-mediated gene 
silencing in mammalian cells. The use of exogenous 21 -nucleotide 
siRNAs holds great promise for analysis of gene function in human 
cell culture and the development of gene-specific therapeutics. It 
will also be of interest in understanding the potential role 
of endogenous siRNAs in the regulation of mammalian gene 
function, □ 

Methods 

RNA preparation 

21 -nucleotide RNAs were chemically synthesized using Expedite RNA phosphoramiditcs 
and thymidine phosphoramidite (Proligo, Germany). Synthetic oligonucleotides were 
deprotected and gel-purified'*. The accession numbers given below are from GenBank. The 
siRNA sequences targeting GL2 (Acc. No. X65324) and GL3 luciferase (Acc. No. U47296) 
corresponded to the coding regions 153-173 relative to the first nucleotide of the start 
codon; siRNAs targeting RL (Acc. No. AF025846) corresponded to region 1 19-139 after 
the start codon. The siRNA sequence targeting lamin A/C (Acc. No. X03444) was from 
position 608-630 relative to the start codon; lamin Bl (Acc. No. NM_0O5573) siRNA was 
from position 672-694; NuMA (Acc. No. Zl 1583) siRNA from position 3,988-4,010. and 
vimentin (Acc. No. NM_O0338O) from position 346-368 relative to the start codon. 
Longer RNAs were transcribed with T7 RNA polymerase from polymerase chain reaction 
(PCR) products, followed by gel purification. The 49- and 484-bp GL2 or GL3 dsRNAs 
corresponded to positions 1 13-161 and 1 13-596. respectively, relative to the start of 
translation; the 50- and 501 -bp RLdsRNAs corresponded to position 1 18-167 and 1 18- 
618, respectively. PCR templates for dsRNA synthesis targeting humanized GFP (hG) were 
amplified from pAD3 (ref. 21 ), whereby 50- and 50l-bp hG dsRNA corresponded to 
positions 121-170 and 121-621, respectively, to the start codon. 

For annealing of siRNAs, 20u-M single strands were incubated in annealing buffer 
(lOOmM potassium acetate, 30 mM HEPES-KOII at pH 7.4. 2mM magnesium acetate) 
for I min at 90 °C followed by I h at 37 °C. The 37 °C incubation step was extended 
overnight for the 50- and 500-bp dsRNAs. and these annealing reactions were performed 
at 8.4 |lM and 0.84 \iM strand concentrations, respectively. 

Cell culture 

S2 cells were propagated in Schneider's Drosophila medium (Life Technologies) supple- 
mented with 10% fetal bovine serum (FDS) 100 units ml" 1 penicillin, and lOOp-gml" 1 
streptomycin at 25 °C. 293. NIH/3T3, HeLa S3, HeLaSS6. COS-7 cells were grown a( 37 °C 
in Dulbecco's modified Eagle's medium supplemented with 10% FDS, 100 units ml" 1 
penicillin, and lOOp-giul - ' streptomycin. Cells were regularly passaged to maintain 
exponential growth. Twenty-four h before transfection at 50-80% conflucncy, mamma- 
lian cells were trypsinized and diluted 1:5 with fresh medium without antibiotics 
(1-3 X 10' cells ml"') and transferred to 24-well plates (500 u.1 per well). S2 cells were not 
trypsinized before splitting. Co- transfection of reporter plasm ids and siRNAs was carried 
out with Lipofectamine 2000 (Life Technologies) as described by the manufacturer for 
adherent cell lines. Per well. 1.0 p.g pGL2 -Control (Promega) or pGL3-Control 
(Promega).O.I u.gpRL-TK (Promega), and 0.21 u.g siRNA duplex or dsRNA, formulated 
into liposomes, were applied; the final volume was 600 p.1 per well. Cells were incubated 
20 h after transfection and appeared healthy thereafter. Luciferase expression was 
subsequently monitored with the Dual luciferase assay (Promega). Transfection 
efficiencies were determined by fluorescence microscopy for mammalian cells lines after 
co-transfection of 1.1 p.g hGFP-encoding pAD3 (ref. 21 ) and 0.21 u.g inverted GL2 siRNA. 
and were 70-90%. Reporter plasmids were amplified in XL-1 Blue (Stratagene) and 
purified using the Qiagen EndoFrec Maxi Plasmid Kit. 

Transfection of siRNAs for targeting endogenous genes was carried out using 
Oligofectamine (Life Technologies) and 0.84 u.g siRNA duplex per well, but it was recently 
found that as little as 0.01 u-g siRNAs per well are sufficient to mediate silencing. HeLa SS6 
cells were transfected one to three times in approximately 15 h intervals and were assayed 
40 to 45 h after the first transfection. It appears, however, that a single transfection is as 
efficient as multiple transfections. Transfection efficiencies as determined by immuno- 
fluorescence of targeted cells were in the range of 90%. Specific silencing of targeted genes 
was confirmed by at least three independent experiments. 

Western blotting and immunofluorescence microscopy 

Monoclonal 636 lamin A/C specific antibody^ was used as undiluted hybridoma super- 
natant for immunofluorescence and 1/100 dilution for western blotting. Affinity-purified 
polyclonal NuMA protein 705 antibody"** was used at a concentration of lOpLgml" 1 for 
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A tetracycline-con trolled trans repressor protein has been engineered to silence transcriptional activities of 
eukaryotic promoters that are stably integrated into the chromatin of human cells. By fusing the KRAB domain 
of human Koxl to the Tel repressor derived from Tn/0 of Escherichia coli, a tetracycline-con trolled hybrid 
protein (TetR-KRAB) was generated and constitutively expressed in HeLa cells. The TctR- KRAB protein binds 
to let operator (tetO) sequences in the absence but not in the presence of tetracycline When TetR-KRAB bound 
to tetO sequences upstream of the immediate-early promoter-enhancer of human cytomegalovirus (CMV), the 
expression of a CMV-driven luciferase reporter construct (ptet07-CMV-L) was repressed in transient trans- 
fection experiments. This silencing was found to operate on different promoters and from tetO sequences placed 
more than 3 kb from the transcriptional start site. We constructed a stable, doubly transfected cell line 
(TIS-10) carrying a chromosomaliy integrated ptet07-CMV-L reporter construct and expressing the TetR- 
KRAB protein. Upon addition of tetracycline, luciferase expression was induced more than 50-fold above the 
baseline level, with half-maximal induction by 2 days. Furthermore, a protein of around 110 kDa was found to 
coimmunoprecipitate with the TetR-KRAB fusion protein. This protein might play a role as an adaptor protein 
mediating the silencing exerted by the TetR-KRAB protein. The TetR-KRAB silencing system should be useful 
as a genetic switch for regulating the expression of chromosomaliy integrated heterologous and endogenous 
genes present in mammalian genomes. 



Inducible gene expression has been a valuable tool for the 
study of gene function in bacteria, yeasts, and Drosophila mela- 
nogaster. In mammalian cells, eukaryotic promoter systems 
that respond to inducing agents such as glucocorticoid hor- 
mone (19, 23), heat shock (40), heavy metal ions (28), or 
interferon (32) have been used. To circumvent limitations due 
either to the leakiness of utilized promoters or the pleiotropic 
effects of inducing agents, chimeric transcription factors have 
been generated, for example, by fusing the strong transcrip- 
tional activator domain of VP 16 to the TnJ0-derived prokary- 
otic tetracycline repressor (TetR) protein (16). In general, 
chimeric transcription factors of this kind, TetR-VP16 (16), 
GAL4-VP16 (33), LexA-VP16 (7), and LacR-VP16 (22), are 
targeted to minimal promoters which are fused to multiple 
DNA binding sites specific for the factor in question. This 
results in marked transcriptional stimulation of the respective 
minimal promoter. In order to modulate the transcriptional 
activity of chimeric transcription factors, movable steroid bind- 
ing domains originating from the glucocorticoid receptor (30) 
or the estrogen receptor (38) have been fused to sequence- 
specific DNA binding domains. In the case of a GAL4-VP16- 
estrogen receptor fusion, estrogen administration has been 
shown to produce a functionally active chimeric transcription 
factor that activates a target promoter (6). 

In the tetracycline system, the TetR protein fused to the 
transactivating domain of VP16 (called tTA) has been shown 
to bind to and strongly activate minimal promoter systems 
containing seven tetracycline operator (tetO) sequences (15, 
16). The binding of tTA to the tetO sequences is blocked by 
tetracycline, preventing the activation of the target promoter 
which generates a very tight genetic switch (16). 

In this report, we present a novel system for tetracycline- 
controlled silencing of eukaryotic promoters. Here, the KRAB 
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repressor domain of the human Koxl zinc finger protein (27) 
has been fused to the TetR protein. In the absence of tetracy- 
cline, this chimeric DNA-binding protein (TetR-KRAB) exerts 
its silencing activity by binding to several cw-acting tetO sites 
placed at a distance from the transcriptional initiation site of a 
eukaryotic promoter. Promoter activity is restored upon ad- 
ministration of tetracycline, which prevents binding of TetR- 
KRAB to the tetO sequences. We also present evidence that 
the TetR-KRAB forms a complex with a protein of around 110 
kDa. This protein, provisionally called SMP1 (silencing-medi- 
ating protein 1), might constitute a putative corepressor pro- 
tein that mediates the repression activity employed by the 
KRAB domain of Koxl. 

MATERIALS AND METHODS 

Plasmid constructions. The fusion between TetR and 121 amino acids of the 
NH 2 terminus of Koxl was constructed by PCR amplification. A PCR fragment 
was generated by using two synthetic oligonucleotides, 5 ' ATCAG O AATTC 
AA CCATGGC TAG ATT AG ATAAAAG3 ' and 5 GTCCT GTCG AC CTTTCTC 
TrCTITTTTGGCGACCCACriTCACAn'n*, and pUHD15-l (16), intro- 
ducing an Ncol site around the translational start of the TetR preceded by an 
EcoRl restriction site (sites are underlined). In addition, six amino acids com- 
prising the simian vims 40 (SV40) large-T-antigen nuclear localization sequence 
(20) (nucleotides in boldface) were fused in frame with the last amino acid 
residue of TetR followed by a Sail restriction site (underlined). The resulting 
PCR fragment was digested with EcoRl and Sail. A second PCR fragment was 
generated by using two oligonucleotides, 5 ' G AC AG GTCG AC GGCGGTGG T 
GCTTTGTC3' and 5 'GCTG CGG ATCC TTAAACTGATG ATTTG ATTTC3 ' . 
and plasmid pKoxl (35), introducing a Sail site at the NH 2 -terminal end of the 
KRAB domain of Koxl and a stop codon at position 121 of Koxl followed by a 
BamHl restriction site. After digestion with Sail and BamHl, both fragments 
were ligated into EcoRl- and BamHI-digested pUHDIO-l (12), generating 
pCMV-tetR-KRAB. To generate oCMV-tetR-KRAB-byg, a HindRl-Bsml frag- 
ment of plasmid p220.3, a derivative of p22Q.l (13) (kindly provided by M. 
Gossen and H. Bujard) containing the hygromycin resistance gene under control 
of the herpes simplex virus (HSV) thymidine kinase (TK) promoter, was intro- 
duced into a Pvull site downstream of the poryadenylalion signal in pCM V-tetR- 
KRAB. To generate pCMV-tetR, pCMV-tetR-KRAB was digested with Sail and 
BamHl, and the ends were filled in and religated. This retains the fusion of the 
complete TetR to the nuclear localization signal of SV40 large T antigen (Pro- 
Lys-Lys-Lys-Arg-Lys) followed by five amino acids originating from the vector 
backbone (Val-Glu-Ile-Gln-Thr-Stop). 

Plasmid ptet07-CMV-L was generated by Ugating the 7 86 -bp Xhol-BamVtt 
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cytomegalovirus (CMV) promoter fragment of pUHDICM (12) into Xho\- 
Bam HI -digested pUHC13-4 (16). This places seven tctO sequences just upstream 
of the human CMV immediate-early promoter/enhancer (4). Plasmid ptet07- 
TK-L was generated by ligating an Xhol-Bgfil fragment from pT1091uc (29) 
comprising the HSV TK promoter (-109 to +52, both GC boxes and the 
CCAAT box present) into Xho\ -Bam HI -d igested pUHC13-4 (16). 

Cell culture and transfections. HeLa cells were grown in Iscove's modified 
Duibecco medium (GIBCO) supplemented with 5% fetal calf serum (FCS; 
Boehringer Mannheim) in a 5% COj environment at 37°C. Cells were plated at 
a density of 10 5 cells per 35-mm-diameter dish and transfected at 80% conflu- 
ence with 5 u.g of plasmid DNA (4.5 jig of the expression and 0.5 p.g of the 
reporter plasmid) as a calcium phosphate precipitate (1). After the transfection, 
cells were further incubated in Iscove's modified Duibecco medium supple- 
mented with 5% FCS and no or 0.5 jig of tetracycline (Sigma) per ml for 24 h 
before determination of luciferase activities from cell extracts. Determination of 
(J-galactosidase activities (1) from cotransfectcd pCHllO (2 ^g per dish; Phar- 
macia) was used to control for transfection efficiencies. 

To establish stable cell lines expressing the TetR-KRAB fusion protein, HeLa 
cells were transfected with 2.5 p.g of pCMV-tetR-KRAB-hyg DNA linearized 
withAjp 700. Forty-eight hours after transfection, cells were trypsinized and split 
at a ratio of 1:10, and resistant colonies were selected in lscove modified Dui- 
becco medium supplemented with 5% FCS (Boehringer) and 250 jig of hygro- 
mycin B (Sigma) per ml. Resistant colonies were subcloned by limiting dilution 
in nonselective medium and tested for expression of functional TetR-KRAB 
protein, resulting in the A12 cell line. To establish a cell line containing the 
ptet07-CMV-L construct together with pC M V-tetR-K RAB, A12 cells were 
transfected with 5 |i.gof a 15:1 mixture of ptet07-CM V-L linearized with HindlU 
and pNE05' (26). Colonies resistant to G418 (500 ng/ml) were grown in pres- 
ence of 0.5 jig of tetracycline per ml, and luciferase activities were determined 
from extracts. One positive clone, called TIS-10, was chosen for further charac- 
terization. 

La rife rase assays. Cells were grown in absence or presence of tetracycline (0.5 
jig/ml) in 35-mm-diameter dishes, washed with 2 ml of phosphate-buffered saline 
(PBS), and then rysed in 500 p.1 of 20 mM Tris-HCl (pH 7^)-150 mM NaCl-1 
mM EDTA-1 mM dithiothreitol-0.6% Nonidet P-40. The tysate was collected 
and centrifuged in an Eppendorf centrifuge for 10 s. Aliquots (10 jaI) of the 
supernatants were mixed with 350 u-1 of 25 mM grycylglycine-15 mM MgS0 4 -5 
mM ATP and assayed for luciferase activity (14) in a Lumat LB9501 (Berthold, 
WQdbacL Germany), using the integral mode (10 s). D-Luciferin (L 6882; Sigma) 
was used at 1 mM. Protein content of the hysates was determined by the method 
of Bradford (5). 

Generation of Koxl antiserum. Recombinant Koxl protein was generated by 
using the T7 expression vector pAR3039 (31). Briefly, the BamYU cDNA frag- 
ment of Koxl (35) was cloned into the Bam HI site of pAR3039, expressed in the 
Escherichia coti host strain BL2l(DB3), and the Koxl protein was purified by 
Mono S fast protein liquid chromatography (36). Functionally active protein was 
obtained after renaturing Koxl in a diaryzing procedure adapted for zinc finger 
proteins (28a). One rabbit was immunized with renatured Koxl protein. 

Immunofluorescence. HeLa A12 cells were grown in tissue culture chamber 
slides (Miles Scientific), washed with PBS, fixed in 3% paraformaldehyde in PBS 
for 10 min, washed three times with PBS, and permeabilized with 0.6% Nonidet 
P-40 in PBS for 5 min. After washing with PBS, the fixed monolayer was blocked 
by incubation with 3% bovine serum albumin (BSA) in PBS for 20 min to reduce 
nonspecific antibody adsorption. Cells were then washed with PBS and incubated 
for 40 min with the rabbit anti-human Koxl immune serum diluted 1/500 in 3% 
BSA in PBS. After four washes with PBS, cells were incubated for 20 min in the 
dark with a 1/100 dilution of a fluoresce in -conjugated goat anti-rabbit immuno- 
globulin G (Boehringer Mannheim) in 3% PBS. Finally, cells were washed five 
times with PBS and covered with 4 ^1 of 90% grycerol-1 mM Kl-0.2 jig of 
4\6-d^midino-2-phenylindole (DAPI) per ml (for staining of the DNA). Cov- 
ers! ips were applied and sealed in place with clear nail polish. Cells were visu- 
alized by fluorescence microscopy with a Zeiss microscope at a magnification of 
1:200. 

ElectrophoreUc mobility shift assay. A 12 cells were grown in 35-mm-diameter 
dishes to 80% confluence, washed with PBS, and scraped off the plate, and the 
sedimented cells were resuspended in 200 pJ of a buffer containing 20 mM 
A^-2-hydroxye%lpiperazine-N'-2-ethanesulfonic acid (HEPES)-KOH (pH 7.9), 
25% glycerol, 400 mM NaCl, 1 mM EDTA, 1 mM EGTA, 0.1 mM Na 3 Va0 4 , 25 
mM ^-glycerophosphate, 1 mM dithiothreitol, and 05 mM phenylmethylsulfonyl 
fluoride. After freeze- thawing and incubation for 10 min on ice, the cellular 
debris was sedimented (10 min ( 13,000 X g). Aliquots of total cell extract (2 u>g 
of total protein) were mixed with 2 fmol of ^P-Iabeled tet0 2 DNA (16) in 25 p.1 
of binding buffer containing 10 mM HEPBS-KOH (pH 7.9), 1 mM EDTA, 4% 
Ficoll 400, and 4 u.g of sonicated calf thymus DNA. After 15 min at room 
temperature, the binding reaction mixture was loaded onto a 4% polyacryl- 
amide-0.2% bisacrylamide gel with 0J25x Tris-borate-EDTA (1) as the running 
buffer. Polyacrylamide gel electrophoresis (PAGE) was performed at room tem- 
perature at 7.5 V/cm. For assays in which antibodies were included, binding was 
allowed for 10 min before addition of Z5 uJ of a 1:10 dilution of serum in binding 
buffer, and incubation was continued for an additional 5 min at room tempera- 
ture. When deoxycholate (DOC) was used, appropriate dilutions of the DOC in 
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binding buffer were added after 10 min incubation in absence of DOC Dried gels 
were visualized by autoradiography using Kodak AR-5 film. 

I mmurw precipitation. HeLa and HeLa A12 cells were grown in 100-mm- 
diameter dishes to 80% confluence, washed, and grown for 2 h in 4 ml of 
L-melhionine-free modified Eagle's medium (GIBCO) supplemented with 5% 
diaryzed FCS and 500 p.Ci of L«{ 35 Sjmetbiooine (Amersham). Following the 
addition of 4 ml of Dulbecco's modified Eagle's medium (GIBCO) supple- 
mented with 5% FCS, cells were grown for an additional 30 min, washed with 
PBS, harvested, and rysed by freeze-thawing in 500 |U of immunoprecipitation 
buffer (IP buffer, 50 mM Tris-HCl [pH 7.4], 250 mM NaCl, 0.05% Nonidet P-40, 
5 mM EDTA, 50 mM NaF, 1 mM Na 3 Va0 4 , 0.5 mM phenylmethylsulfonyl 
fluoride, 2 mM dithiothreitol) followed by centrifugation for 10 min at 10,000 X 
g. The crude supernatant was adjusted to 1% BSA in IP buffer and precleared 
with 50 uJ of a 50% slurry of protein G-agarose beads (Boehringer). The super- 
natant was divided in two aliquots; to each, 2 }il of the anti-Koxl rabbit immune 
serum was added. After 2 h of incubation at 4°C, a slurry of 50 \l\ of protein 
G-agarose was added, and the mixture was incubated for an additional 30 min. 
The beads were collected and washed three times in IP buffer. To one aliquot of 
the beads, 50 jil of IP buffer containing 0.1% DOC was added; after 5 min at 
room temperature, the beads were separated from the supernatant. Fifty micro- 
liters of IP buffer was added to the beads, and the samples were adjusted to 
SDS-sample buffer and separated on an SDS-12% polyacrylamide gel (1). The 
gel was soaked in Amplify (Amersham), and the labeled proteins were visualized 
by autoradiography. 

RESULTS 

Construction of expression plasmids. It was recently shown 
that a fusion protein of the yeast GAL4 DNA binding domain 
and the KRAB domain of Koxl expressed in mammalian cells 
strongly repressed a cotransfected chloramphenicol acetyl- 
transferase reporter construct containing GAL4 binding sites 
upstream of the TK promoter (27). We decided to take ad- 
vantage of this finding. We fused the first NH 2 -terminal 121 
amino acid residues of Koxl, containing the KRAB domain 
(35), to the C terminus of the prokaryotic TetR encoded by 
Tn70 from E. coli (16) to create a tetracycline-responsive tran- 
scriptional silencer protein, TetR-KRAB. As a linker between 
the two domains, six amino acids comprising the nuclear lo- 
calization sequence of SV40 large T antigen (20) were added 
to favor nuclear localization of the TetR-KRAB fusion protein 
in mammalian cells. In plasmids used for expression of TetR- 
KRAB (pCMV-tetR-KRAB and pCMV-tetR-KRAB-hyg) in 
mammalian cells, the sequence encoding TetR-KRAB is 
flanked upstream by the human CMV immediate-early pro- 
moter and downstream by the S V40 po!y(A) site. As a control, 
the KRAB domain was removed, creating TetR fused to the 
nuclear localization signal followed by five additional amino 
acids in pCMV-tetR The reporter plasmids (ptet07-CMV-L 
and ptet07-TK-L) are based on pUIIC13-4 (16). In these 
constructs, the CMV or HSV TK promoter was inserted down- 
stream of seven tetO sequences and upstream of the luciferase 
as a reporter gene. The right boundary of the fragment con- 
taining the seven tetO sites is located more than 685 and 109 bp 
upstream of the transcriptional initiation site of the CMV and 
TK promoters, respectively. Relevant features of plasmids 
pCMV-tetR-KRAB, pCMV-tetR, and ptet07-CMV-L are 
outlined in Fig. 1. The rationale is to generate a fusion protein 
TetR-KRAB that binds in cis to tetO sequences upstream of 
the transcriptional initiation site of the CMV promoter (Fig. 1) 
or to any other promoter. Binding of the KRAB domain to the 
tetO sites should result in active repression of productive tran- 
scription measured by luciferase activity. Addition of low levels 
of tetracycline to the growth medium should prevent TetR- 
KRAB from binding to the tetO sequences and should thereby 
restore full productive transcriptional initiation from the re- 
spective promoter. 

Repression of transcription mediated by the TetR-KRAB 
fusion protein. HeLa cells were cotransfected with the reporter 
plasmid ptet07-CMV-L and one of the expression plasmids 
pCMV, pCMV-tetR-KRAB, and pCMV-tetR and grown with 
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FIG. 1. Schematic representation of main features of expression and reporter constructs. (A) In the expression plastnid pCMV-tetR-KRAB, the TetR-KRAB 
sequence is placed downstream of the strong immediate-early human CM V promoter (Pcmv) and is followed by an SV40 poh/adenylation signal (An). The TetR-KRAB 
fusion protein is composed of the complete 207-amino-acid sequence of TetR fused to 6 amino acids representing the SV40 large-T-antigen nuclear localization 
sequence (NLS) and a 121-amino-acid KRAB domain originating from the KH 2 terminus of human Kaxl containing the KRAB A and B domains (27). The KRAB 
domain has been removed in the control expression plasm id pCM V-tetR. (B) The reporter plasmid ptet07-CMV-L consists of seven tctO sequences placed upstream 
of the promoter/enhancer sequence (positions -675 to +75, +1 being the transcriptional start site) of human CMV followed by sequences encoding firefly luciferase 
and an SV40 pory(A) site (An). The distances of the boundaries of the te/O-containing fragment (-994 and -685) with respect to the transcriptional start site of the 
CMV promoter and the sites for insertion of the promoter(s), X (Xhol) and B (BamHV), are indicated. 



or without tetracycline for 1 day (Fig. 2). In transfections using 
the control vector pCMV lacking TetR sequences, similar lu- 
ciferase activities were obtained from cells grown with or with- 
out tetracycline. In contrast, luciferase activity was reduced 
about 10-fold in transfections with pCMV-tetR-KRAB (Fig. 2, 
CMV-tRK). In these cells, luciferase expression was restored 
by addition of 0.5 u.g of tetracycline per ml to the medium. No 
repression was observed when TetR fused to the nuclear lo- 
calization sequence was expressed alone (Fig. 2, pCMV-tR), 
demonstrating that the repression is mediated by the KRAB 
domain.. Similar results were obtained for the reporter con- 
struct ptet07-TK-L carrying the HSV TK promoter (data not 
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FIG. 1 Tetracycline-reversible silencing of the CMV promoter by TetR- 
KRAB in transiently transfected HeLa cells. HeLa cells (70 to 80% confluent) 
grown in 35-mm-diameter dishes without tetracycline (let) were transiently trans- 
fected at a molar ration of 1:5 with the reporter plasmid ptet07-CMV-L (0.5 p.g) 
and 2.5 jtg of one of the expression plasmids pCM V (CMV), pC MV-tetR-KRAB 
(CMV-tRK), and pCMV-tetR (CMV-tR). After transfection, cells were grown 
for an additional 24 h with or without 0J5 jig of tetracycline per mJ before 
luciferase activities were determined from aliquots (1/50) of total cell extracts. 



shown). No differences in luciferase activities were observed 
for reporter constructs lacking tetO sequences (data not 
shown). This observation suggested that TetR-KRAB inter- 
fered with productive initiation at the CMV or TK promoter 
only when bound to tetO sequences. We next made stable 
t ran sfect ants expressing TetR-KRAB under the control of the 
CMV promoter. The construct used (pCMV-tetR-KRAB-hyg) 
contained a hygromycin resistance gene under the control of 
the TK promoter inserted into pCMV -tetR-KRAB (see Ma- 
terials and Methods). Clones resistant to hygromycin B were 
assayed for expression of the TetR-KRAB protein, and the 
intracellular localization of the TetR-KRAB hybrid protein 
was visualized by indirect immunofluorescence. In one of the 
stable cell lines examined, A12, the TetR-KRAB protein was 
localized predominantly to the nucleus by indirect immunoflu- 
orescence using an anti-Koxl immune serum (Fig. 3d and h). 
The staining was specific since the preimmune serum showed 
very little staining (Fig. 3b and f). A12 cells (Fig. 3d) showed a 
much stronger nuclear staining than the parental HeLa cells 
(Fig. 3h), as a result of the expression of the TetR-KRAB 
protein. The staining of HeLa nuclei (Fig. 3h) suggests that 
nuclear proteins that are immunologically related or identical 
to Koxl are expressed in HeLa cells. Koxl mRNA has been 
detected in HeLa cells (35). 

The luciferase activities from HeLa A 12 cells transiently 
transfected with plasmids pCMV-L and pTK-L, which lack 
tetO sequences, were not altered by the presence of tetracy- 
cline in the medium (Fig. 4). However, tetO sequences inserted 
in ptet07-CMV-L and ptet07-TK-L resulted in a significant 
reduction of luciferase activities in the absence of tetracycline 
of more than 1 order of magnitude (Fig. 4). Addition of tet- 
racycline fully restored the expression of luciferase (Fig. 4). 
Luciferase activities of these plasmids transfected into parental 
HeLa cells that lack stably expressed TetR-KRAB protein 
were not affected by tetracycline (data not shown). Increasing 
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FIG. 3. Expression and nuclear localization of TetR-KRAB fusion protein in 
HeLa A12 cells. HeLa cells were stably transfected with pCMV-TetR-KRAB- 
hyg, and stable clones were selected in hygromycin B. The right-hand row of 
photomicrographs shows indirect immunofluorescence with an anti-Kaxl im- 
mune serum of stable transfected HeLa A 12 (d) and HeLa (h) cells. A preim- 
mune serum was used as a control for A12 (b) and HeLa (f) cells. Fluoresce! n- 
conjugated goat anti-rabbit antibodies were used as secondary antibodies. The 
DNAs of the respective cells were stained with DAPI to visualize the nuclear 
compartments and are shown in panels a, c, e, and g. 



the distance of the tetO sequences to the transcriptional start 
site to about 3,600 bp still resulted in significant tetracycline- 
reversible repression of luciferase expression by TetR-KRAB 
which was just threefold lower than that at a distance of 685 bp 
(13a). Furthermore, the orientation of the te/O-containing 
fragment with regard to the promoter had no influence on the 
observed repression (13a). All of the many different promoters 
tested so far in this system were found to respond to TetR- 
KRAB-mediated silencing equally well (data not shown). We 
conclude that TetR-KRAB stably produced in HeLa cells is 
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FIG. 4. Tetracycline-reversible silencing of the CMV and HS V TK promot- 
ers in HeLa A12 cells stably transfected with pCMV-tetR-KRAB. HeLa A12 
cells (70 to 80% confluent) grown in 35- mm-dia meter dishes without tetracycline 
(tet) were transiently transfected in duplicate with, per dish, 2.5 pig of pCM V-L 
or ptet07-CMV-L (A) or 2.5 jig of pTK-L or ptet07-TK-L (B). After transfec- 
tion, the cells were grown in the presence or absence of 0.5 jig of tetracycline per 
ml in the medium for an additional 24 h before luciferase activities were deter- 
mined from 1/50 of total extracts of transfected cells. Two independent experi- 
ments are shown for ptet07-CMV-L (A) and ptet07-TK-L (B). 



able to actively repress the productive expression from both 
the CMV and TK promoters by binding to ciy-acting tetO 
sequences placed at a distance from the respective transcrip- 
tional initiation sites. 

Reversible repression of the ptet07-CMV-L construct stably 
integrated into the genome of HeLa A 12 cells. To quantify the 
tetracycline-inducible repression of the CMV promoter, the 
construct ptet07-CMV-L was stably integrated into HeLa A12 
cells that express the TetR-KRAB fusion protein. HeLa A12 
cells were cotransfected with ptet07-CMV-L linearized at the 
unique Asp 700 site and pNeo 5' (26). One clone resistant to 
G418 that showed significant luciferase levels when grown in 
the presence of 0.5 jjLg of tetracycline per ml was selected and 
named TIS-10. Luciferase activities were determined in ex- 
tracts from TIS-10 cells after growth in medium containing or 
lacking tetracycline. The presence of tetracycline for 24 h in 
the growth medium of TIS-10 cells resulted in more than 
10- fold increases in luciferase activity (Fig. 5). No obvious 
change in growth behavior was detected in the presence of 
tetracycline, which demonstrates that the TetR-KRAB fusion 
protein can locate the tetO sequences in the chromatin of 
mammalian cells and reduce the activity of the ptet07-CMV-L 
construct within the genomic environment in a reversible fash- 
ion. A tetracycline concentration of 0.1 u,g/mJ was sufficient to 
release repression (data not shown). This concentration is well 
below the concentration of 10 p.g/ml above which inhibition of 
growth had been observed (reference 16 and data not shown). 

Kinetics of tetracycline action. The time course of tet racy- 
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FIG. 5. Tetracycline-inducible silencing of a chromosomaUy integrated CM V 
promoter fused to letO sequences. The ptet07-CMV-L construct was stably 
integrated into HeLa A12 cells, resulting in the TIS-lOcell line. TIS-10 cells were 
grown in 35-mm-diameter dishes (70% confluence), and tetracycline (tet; 0.5 
jig/ml) was either omitted or added to the growth medium. After 24 h, luciferase 
activities were determined from 1/50 of extracts from cells grown with or without 
tetracycline. Data from three individual dishes (TIS-10-1 to TIS-10-3) are shown. 



dine action on luciferase expression in TIS-10 cells was ana- 
lyzed in two different ways. First, tetracycline was added to 
cultures grown in the absence of tetracycline (Fig. 6). By 24 h, 
the luciferase activity was stimulated 10-fold over the nonin- 
duced values. This stimulation had increased to more than 
30-fold by 2 days and more than 50-fold after 3 days. In a 
second experiment, we measured the decrease of luciferase 
activities after removal of tetracycline. Within 3 days, these 
levels dropped more than 50-fold (data not shown). 

The hybrid protein (TetR-KRAB) produced in HeLa A12 
cells is associated with a cellular protein of around 110 kDa. 
To investigate the mechanism of TetR-KRAB-mediated re- 
pression, we performed electrophoretic mobility shift analysis 
with labeled synthetic tetO DNA and crude extracts from A12 
cells stably producing the TetR-KRAB protein. Incubation of 
the labeled tetO sequences with crude extract from A 12 cells 
led to the formation of a very slowly migrating specific complex 
CI (Fig. 7 A, lane 3). The migration of this complex cannot be 
explained simply by a dimer of the 38-kDa TetR-KRAB pro- 
tein bound to a 21-bp teiO DNA fragment. In several experi- 
ments, a significant fraction of the specific complexes did not 
even enter the gel, suggesting that large complexes might have 
been formed. Such complexes were not observed with extracts 
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FIG. 6. Time course of tetracycline action in TIS-10 cells. TIS-10 cells were 
grown in the absence of tetracycline (tet), trypsinxzed, seeded in 35-mm-diameter 
dishes at low density (15% confluency), and further grown without or with 0.5 mg 
of tetracycline per ml in the growth medium. After the indicated times, relative 
luciferase activities were determined from ahquots of extracts (1/50) of cells. 



from parental HeLa cells (Fig. 7 A, lane 2). Complex formation 
was sensitive to tetracycline (1 u>g/ml) and specifically com- 
peted for by an excess of unlabeled tetO DNA, while DNA 
fragments with unrelated sequences had no effect (data not 
shown). 

To investigate further the nature of these large complexes, 
the ionic detergent DOC was added to the binding reactions in 
order to dissociate weak protein-protein interactions (2). The 
presence of 0.05 to 0.2% DOC resulted in the disappearance of 
the large complex CI and the formation of a faster-migrating 
complex, C2 (Fig. 7 A, lanes 4 to 6). The nature of complex C2 
was further investigated by adding a Koxl-specific immune 
serum (28a). In the presence of a 1:200 dilution of the poly- 
clonal anti-Koxl immune serum together with 0.1% DOC, a 
retarded complex, C3, was formed (supershift; Fig. 7B, lane 5). 
No supershift was observed with the preimmune serum (Fig. 
7B, lane 4), which suggested that the faster-migrating complex 
C2 was probably composed of a TetR-KRAB dimer bound to 
the tetO sequence. The large complex CI was also detected in 
extracts from HeLa cells transiently transfected with pCMV- 
tetR-KRAB (CI; Fig. 7C, lane 3). In contrast, a much smaller 
complex, C4, was formed with extracts from HeLa cells trans- 
fected with pCMV-tetR expressing the 25-kDa TetR protein 
fused to the nuclear localization sequence but lacking the 
KRAB domain (C4: Fig. 7C, lane 4; see also Fig. 1 and 2). 
Complex C4, migrating just slightly above a nonspecific com- 
plex (asterisk in Fig. 7C), is not formed in the presence of 1 \l% 
of tetracycline per ml and is insensitive to low levels of DOC 
(not shown). 

The very slowly migrating complex CI could be the result 
either of TetR-KRAB dimers associating with each other to 
form a larger complex or of a complex in which the KRAB 
domain of Koxl is associated with a cellular factors). As a first 
step toward distinguishing between these possibilities, we met- 
abolically labeled HeLa A12 cells and HeLa cells with 
L-[ 35 S]methionine and prepared extracts. Extracts were immu- 
noprecipitated with the rabbit anti-Koxl immune serum and 
protein G-Sepharose beads and separated by SDS-PAGE (Fig. 
8). Immunoprecipitates of extracts from HeLa A12 cells 
showed two bands corresponding to the predicted size of the 
TetR-KRAB fusion protein of 38 kDa (Fig. 8, lane 2). This 
doublet was specifically missing in extracts from HeLa cells 
(Fig. 8, lane 5). The occurrence of a doublet may indicate 
posttranslational modification, such as phosphorylation, of the 
TetR-KRAB fusion protein synthesized in HeLa cells. In ad- 
dition, a band corresponding to a protein of around 110 kDa 
was specifically coimmunoprecipitated together with TetR- 
KRAB (Fig. 8, SMP1, lane 2). This band was not visible in 
extracts from HeLa cells (Fig. 8, lane 5). Most significantly, if 
the protein G-Sepharose beads were extracted once with 50 u,I 
of IP buffer containing 0.1% DOC, the 110-kDa protein was 
released from the beads (Fig. 8, lane 3) and appeared in the 
DOC supernatant (Fig. 8, lane 4). The DOC sensitivity of the 
association of the 110-kDa protein to TetR-KRAB in the co- 
immunoprecipitation is strikingly reminiscent of the DOC sen- 
sitivity of the large CI complexes formed in extracts from 
HeLa A12 cells with labeled tetO sequences in the electro- 
phoretic mobility shift analysis (Fig. 7 A, lanes 3 to 5). In ad- 
dition, the large complexes are not formed with the TetR 
protein lacking the KRAB domain. We therefore propose that 
a protein of around 110-kDa interacts with the KRAB domain 
of the TetR-KRAB fusion protein. This protein might be in- 
volved in the mechanism of repression of promoters from a 
distance by TetR-KRAB. We have tentatively named this pro- 
tein SMP1. 
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FIG. 7. DNA binding activity of TetR-KRAB from HeLa A12 cells. (A) Crude extracts from HeLa cells (lane 2) or HeLa A12 cells (lanes 3 to 8) were prepared 
and incubated with i2 P-end-labeled tetO DNA for 10 mio before DOC was added to the indicated final concentration. After further incubation for 5 min, the samples 
were subjected to electrophoresis in a 4% poryacrylamide gel. As a control, no extract was added (lane 1). The positions of the free tetO DNA and complexes C*, CI, 
and C2 are indicated. (B) Crude extracts from HeLa (lane 1) or HeLa A12 (lanes 2 to 5) cells were prepared and incubated with 32 P-end- labeled tetO DNA for 10 min 
at room temperature before DOC was added to 0.1% (final concentration) where indicated (lanes 3 to 5). No serum (lane 3) or either a preimmune serum (P) or an 
anti-Koxl immune serum ([) was added afterwards to 1% (final concentration) (lanes 4 and 5), and the samples were subjected to electrophoresis in a 4% 
poryacrylamide gel. The positions of the free tetO DNA and the different complexes (C*. CI, C2, and C3) that were formed are indicated. (C) Crude extracts from HeLa 
cells, transiently transfected with pCMV (lane 2), pCMV-tetR-KRAB (lane 3), or pCMV-tetR (lane 4), were prepared, and aliquots were incubated with ^P-end- 
labeled tetO DNA for 15 min at room temperature and subjected to electrophoresis in a 5% polyacryl amide gel. The positions of the free tetO DNA, the specific 
complexes CI and C4, and a nonspecific complex (*) are indicated. 



DISCUSSION 

In recent years, domains of various proteins that, when fused 
to DNA binding domains, can actively repress the activity of 
promoters containing appropriate target sequences have been 
described (reviewed in references 10 and 11). Among these is 
the KRAB domain of Koxl, which functions very efficiently 
(13a, 27). The KRAB domain is an evolutionarily conserved 
domain of 75 amino acids comprising a heptad repeat of me- 
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FIG. 8. A protein of around 110 kDa is coimmunoprecipitated with the 
TetR-KRAB from HeLa A12 cells. Whole cellular extracts of HeLa (lanes 5 to 
7) and HeLa A12 (lanes 2 to 4) cells, labeled with L-| 35 SJmethionine, were 
incubated with an anti-Koxl immune serum, and the immune complexes were 
collected with protein G-agarose beads (B; lanes 2 and 5). Half of the beads were 
incubated with a buffer containing 0.1% DOC, and the supernatants (S, DOC; 
lanes 4 and 7) were separated from the beads (B, DOC; lanes 3 and 6). The 
proteins on the beads and in the supernatants were analyzed by SDS-PAGB 
(12% gel) and fluorography. The positions of the 38-kDa TetR-KRAB fusion 
protein and a 110- to 120-kDa protein named SMP1 are indicated. The sizes of 
"C-labeled protein standards (lane 1) are indicated on the left 



thionine and leucine residues in Koxl (35). It has been esti- 
mated that approximately one-third of all zinc finger-type 
DNA-binding proteins contain a KRAB domain NH 2 terminal 
to the zinc finger region (3). We show here that a fusion of the 
KRAB repression domain derived from the human Koxl pro- 
tein to the Tnitf-derived E. coli TetR generated the TetR- 
KRAB hybrid protein, which functions as a potent DNA-bind- 
ing-site-dependent transrepressor of transcription. The TetR- 
KRAB protein was overproduced in HeLa cells to significant 
levels and seems to be localized to the nucleus. The nuclear 
localization is presumably favored because of the presence of a 
nuclear localization signal derived from the SV40 large T an- 
tigen (20) between TetR and KRAB domains (Fig. 1). The 
strong nuclear staining in HeLa A12 cells (Fig. 3) is accompa- 
nied by the presence of a strong DNA binding activity toward 
tetO sequences in crude extracts (Fig. 7A). Furthermore, the 
anti-Koxl immune serum detects the native KRAB domain in 
the TetR-KRAB protein, as demonstrated by the supershift of 
the TetR-KRAB/ferO complex (C3; Fig. 7B, lane 5). The TetR- 
KRAB produced in HeLa cells binds to tetO sequences in vitro, 
and this association is prevented by tetracycline (not shown). 
When the TetR-KRAB protein is bound to tetO sequences 
upstream of the enhancer and the transcriptional initiation site 
of promoters like the immediate-early human CMV promoter 
or the HSV TK promoter, the productive transcriptional ac- 
tivity (determined by using the luciferase gene as a reporter 
gene) of the respective constructs is repressed in transient 
assays. This repression is released by tetracycline and strictly 
dependent on both the presence of the KRAB domain (Fig. 2) 
and cis-acting telO sequences (Fig. 4a and b). Repression by 
the TetR-KRAB protein is observable when tetO sequences 
are placed more than 3 kb upstream or downstream of the 
transcriptional initiation site of a eukaryotic promoter and 
independent of the orientation of the tetO sequences, suggest- 
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FIG. 9. Model of the silencing mechanism by TetR-KRAB and SMP1. An 
enhancer-binding protein exerts its stimulatory effect (-f) on the basal transcrip- 
tional machinery through an adaptor protein. The TetR-KRAB protein binds to 
the tetO sequences, which can be regarded as a as -acting silencer. The silencing 
effect (-) of the TetR-KRAB fusion protein bound to its tetO sequences is 
mediated by the SMP1 protein and could act on different levels of transcriptional 
control as indicated. 



ing that the TetR-KRAB protein functions as a tetracycline 
reversible silencing protein (i3a). Tetracycline-reversible si- 
lencing is also observed with TIS-10 cells that carry the ptet07- 
CMV-L construct stably integrated into the genome of HeLa 
A12 cells expressing the TetR-KRAB protein (Fig. 5). Here 
the addition of tetracycline to the medium of TIS-10 cells 
results in a more than 50- fold activation of productive tran- 
scription from the CMV promoter (Fig. 6). Removal of tetra- 
cycline from the medium of cells grown in the presence of 
tetracycline restores repression (not shown). The kinetics of 
this process is rather slow, with half-maximal effects by 2 days. 
Induction or repression is observed in rapidly dividing and 
nondividing, i.e., confluent, cells, suggesting that replication of 
the DNA may not be essential for the underlying mechanism 
(13a). 

Many other proteins contain repression domains that func- 
tion when fused to heterologous DNA binding domains (8, 9, 
17, 21, 24, 25, 27, 34, 37, 39). The mechanisms by which such 
alanine-rich or proline-rich domains, as well as the KRAB 
domains, exert their repressing functions have not yet been 
identified (10, 11, 18). We show here that the TetR-KRAB 
protein produced in HeLa cells forms very large complexes 
with tetO sequences in vitro (CI; Fig. 7A, lane 3). Low con- 
centrations of DOC disrupt the large complexes and result in 
the formation of smaller complexes, presumably composed of 
TetR-KRAB dimers (C2; Fig. 7A, lanes 4 to 6). The TetR 
lacking the KRAB domain forms even smaller complexes (C4; 
Fig. 7C, lane 4). This finding suggests that at least one factor 
may be associated with TetR-KRAB and released upon treat- 
ment with DOC The immunoprecipitations from extracts of 
HeLa A12 cells obtained by using the anti-Koxl immune se- 
rum show that a protein of around 110 kDa is associated with 
the TetR-KRAB protein (Fig. 8). This protein dissociates from 
TetR-KRAB in the presence of DOC in a manner strikingly 
reminiscent of the DOC sensitivity of the large TetO/TetR- 
KRAB complex CI formed with extracts from HeLa A12 cells 
(Fig. 7A and 8). The data suggest that the 110-kDa protein is 
part of the large complex CI (Fig. 7A, lane 3). We tentatively 
named this protein SMP1 and propose that it is involved in 
mediating the silencing function of TetR-KRAB as outlined in 
Fig. 9. Transcriptional activator proteins bind to sites in the 
CMV enhancer and activate transcription from the basal tran- 
scriptional machinery in part by interacting with adaptor mol- 



ecules that mediate the transactivation function; the function 
of SMP1 could be to mediate the silencing effect of the TetR- 
KRAB protein (Fig. 9). Further work will have to show 
whether the silencing exerted by the TetR-KRAB protein im- 
pinges directly or indirectly (i.e., via SMP1) on factors of the 
basal transcriptional machinery or if it works by influencing the 
chromatin structure of the target gene (Fig. 9). It is tempting to 
speculate that the KRAB-SMP1 interaction may share some 
similarity to protein-protein interactions that are involved in 
establishing repression or silencing in Saccharomyces cerevisiae 
as is the case for Ssn6-Tupl and SIR1 (9, 21, 37). 

Apart from the opportunity to understand the phenomenon 
of transcriptional repression exerted by the KRAB domain of 
Koxl, our data provide a basis for a novel way to regulate gene 
expression in higher mammalian cells. In contrast to a previ- 
ously published system for highly efficient regulation of gene 
expression using a tTA-responsive promoter (15, 16), the tet- 
racycline-reversible silencing of complex promoters by the 
TetR-KRAB protein offers the unique possibility of reversibly 
down-regulating the expression of cellular genes on top of their 
normal cellular regulation. Here, tetO sequences would be 
placed at a distance (0.1 to 3 kb) upstream or downstream from 
the transcriptional start site of the target gene such that the 
normal regulation is not disturbed. Binding of TetR-KRAB 
expressed in the very same cells results in transcriptional re- 
pression that can be released by adding tetracycline to the 
growth medium. Understanding the mechanism of the silenc- 
ing exerted by the KRAB domain might allow improvement of 
the stringency of the system in order to render it applicable for 
the generation of cell lines or whole animals in which the 
expression of particular genes can be down-regulated in a 
reversible fashion. 
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Description 

Background of the Invention 

[0001 ] The invention relates to methods for identifying 
nucleic acid sequences that modulate the function of a 
cell, by the use of posMranscriptional gene silencing. 
[0002] Double stranded RNA (dsRNA) has been 
shown to induce sequence-specific gene silencing in a 
number of different organisms. Gene silencing can oc- 
cur through various mechanisms, one of which is post- 
transcriptional gene silencing (PTGS). In post -transcrip- 
tional gene silencing, transcription of the target locus is 
not affected, but the RNA half-lrfe is decreased. The 
mechanisms by which PTGS occurs are not yet clear. 
Exogenous dsRNA has been shown to act as a potent 
inducer of PTGS in nematodes, trypanosomes, and in- 
sects. In addition, studies in C. eiegans and Drosophiia 
show that a few molecules of dsRNA per cell are suffi- 
cient to trigger a PTGS response. Furthermore, studies 
in mice have demonstrated that dsRNA can interfere 
with the expression of genes in mouse embryos. 
[0003] There exists a need to identify molecules that 
selectively regulate the expression of genes in verte- 
brate cells without the associated toxicity of the interfer- 
on response. Such regulation should allow the down- 
regulation of expression from genes whose gene prod- 
ucts are detrimental to the cells. 

Summary of the Invention 

[0004] In general the invention features high through- 
put methods of using PTGS to identify a nucleic acid 
sequence that modulates the function of a cell, gene ex- 
pression of a target nucleic acid, or the biological activity 
of a target polypeptide. The method involves the use of 
specially constructed cDN A libraries derived from a cell, 
for example, a primary cell or a cell line that has an ob- 
servable phenotype or biological activity, {e.g., an activ- 
ity mediated by a target polypeptide or altered gene ex- 
pression), that are transfected into cells to inhibit gene 
expression. This inhibition of gene expression alters the 
function of a cell., gene expression of a target nucleic 
acid, or the biological activity of a target polypeptide, 
and the nucleic acid sequence responsible for the mod- 
ulation can be readily Identified. The method may also 
utilize randomized nucleic acid sequences or a given se- 
quence for which the function is not known. Although 
the use of PTGS as a validation strategy is known in the 
art, its use in screening techniques, as described herein, 
is novel. 

[0005] Accordingly, in a first aspect, the invention fea- 
tures a method for identifying a nucleic acid sequence 
that modulates the function of a cell. The method in- 
volves: (a) transforming a population of cells with a dou- 
ble stranded RNA expression library, where the library 
is derived from the cells, where at least two cells of the 
population of cells are each transformed with a different 



nucleic acid from the double stranded RNA expression 
library, and where the nucleic acid is capable of forming 
double stranded RNA; (b)optionally selecting for a cell 
in which the nucleic acid is expressed in the cell; and (c) 
5 assaying for a modulation in the function of the cell, 
wherein a modulation identifies a nucleic acid sequence 
that modulates the function of a cell. 
[0006] In a desirable embodiment of the first aspect 
of the invention, assaying for a modulation in the func- 
10 tion of a cell comprises measuring cell motility, apopto- 
sis, cell growth, cell invasion, vascularization, cell cycle 
events, cell differentiation, cell dedifferentiation, neuro- 
nal cell regeneration, or the ability of a cell to support 
viral replication. 

15 [0007] In a second aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates expression of a target nucleic acid in a cell. The 
method involves: (a) transforming a population of cells 
with a double stranded RNA expression library, where 

20 the library is derived from the cells, where at least two 
- cells of the population of cells are each transformed with 
a different nucleic acid from the double stranded RNA 
expression library, and where the nucleic acid is capable 
of forming double stranded RNA; (b)optionally selecting 

25 for a cell in which the nucleic acid is expressed in the 
cell; and (c) assaying for a modulation in the expression 
of a gene in the cell, where a modulation identifies a 
nucleic acid sequence that modulates expression of a 
target nucleic acid in a cell. 

30 [0008] In a desirable embodiment of the second as- 
pect of the invention, the target nucleic acid is assayed 
using DNA array technology. 

[0009] In a third aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 

35 lates the biological activity of a target polypeptide in a 
cell. The method involves: (a) transforming a population 
of cells with a double stranded RNA expression library, 
where the library is derived from the cells, where at least 
two cells of the population of cells are each transformed 

^0 with a different nucleic acid from the double stranded 
RNA expression library, and where the nucleic acid is 
capable of forming double stranded RNA; (b)optionally 
selecting for a cell in which the nucleic acid is expressed 
in the cell; and (c) assaying for a modulation in the bio- 

45 logical activity of a target polypeptide in the cell , wherein 
a modulation identifies a nucleic acid sequence that 
modulates the biological activity of a target polypeptide. 
[0010] In one embodiment of any of the above as- 
pects of the invention, in transforming step (a), the nu- 

50 cleic acid is stably integrated into a chromosome of the 
cell. Integration of the nucleic acid may be random or 
site-specific. Desirably integration is mediated by re- 
combination or retroviral insertion. In addition, desirably 
a single copy of the nucleic acid is integrated into the 

55 chromosome. In another embodiment of any of the 
above aspects of the invention, in step (a) at least 50, 
more desirably 100; 500; 1000; 10,000; or 50,000 cells 
of the population of cells are each transformed with a 
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different nucleic acid from the double stranded RNA ex- 
pression library. In other embodiments, the population 
of cells is transformed with at least 5% : more desirably 
at least 25%, 50%, 75%, or 90%, and most desirably at 
least 95% of the double stranded RNA expression ii- s 
brary In yet another embodiment, the method further 
involves: (d) identifying the nucleic acid sequence by 
amplifying and cloning the sequence. Desirably ampli- 
fication of the sequence involves the use of the polymer- 
ase chain reaction (PCR). 10 
[001 1 ] In other embodiments of any of the above as- 
pects of the invention, the double stranded RNA expres- 
sion library contains cDNAs or randomized nucleic ac- 
ids. The double stranded RNA expression library may 
be a nuclear double stranded RNA expression library, 15 
in which case the double stranded nucleic acid is made 
in the nucleus. Alternatively, the double stranded RNA 
expression library may be a cytoplasmic double strand- 
ed RNA expression library, in which case the double 
stranded nucleic acid is made in the cytoplasm. In ad- 20 
dition, the nucleic acid from the double stranded RNA 
expression library may be made in vitro or in vivo. In 
addition, the identified nucleic acid sequence may be 
located in the cytoplasm of the cell. 

In still another embodiment of any of the above aspects 25 
of the invention, the nucleic acid is contained in a vector, 
for example a double stranded RNA expression vector. 
The vector may then be transformed such that it is stably 
integrated into a chromosome of the cell, or it may func- 
tion as an episomal (non-integrated) expression vector 30 
within the cell. In one embodiment, a vector that is inte- 
grated into a chromosome of the cell contains a promot- 
er operably linked to a nucleic acid encoding ahairpin or 
double stranded RNA. In another embodiment, the vec- 
tor does not contain a promoter operably linked to a nu- 35 
cleic acid encoding a double stranded RNA. In thislater 
embodiment, the vector integrates into a chromosome 
of a cell such that an endogenous promoter is operably 
linked to a nucleic acid from the vector that encodes a 
double stranded RNA. Desirably, the double stranded 40 
RNA expression vector comprises at least one RNA 
polymerase II promoter, for example, a human CMV-im- 
mediate early promoter (HCMV-IE) or a simian CMV 
(SCMV) promoter, at least one RNA polymerase I pro* 
moter, or at least one RNA polymerase III promoter. The 45 
promoter may also be a T7 promoter, in which case, the 
cell further comprises T7 polymerase. Alternatively, the 
promoter may be an SP6 promoter, in which case, the 
cell further comprises SP6 polymerase. The promoter 
may also be one convergent T7 promoter and one con- so 
vergent SP6 promoter. A cell may be made to contain 
T7 or SP6 polymerase by transforming the cell with a 
T7 polymerase or an SP6 polymerase expression plas- 
mid, respectively. In some embodiments, a T7 promoter 
or a RNA polymerase III promoter is operably linked to 55 
a nucleic acid that encodes a small double stranded 
RNA (e.g., a double stranded RNA that is less than 200, 
150. 100. 75, 50, or 25 nucleotides in length). In other 



embodiments, the promoter is a mitochondrial promoter 
that allows cytoplasmic transcription of the nucleic acid 
in the vector (see, for example, the mitochondrial pro- 
moters described in WO 00/63364, filed April 1 9, 2000). 
Alternatively, the promoter is an inducible promoter, 
such as a lac (Cronin et at. Genes & Development 15: 
1506-1517, 2001), ara (Khlebnikov et al., J Bacteriol. 
2000 Dec;182(24):7029-34), ecdysone (Rheogene, 
www.rheogene.com), RU48 (mefepristone) (corticos- 
teroid antagonist) (Wang XJ, Liefer KM, Tsai S, O'Malley 
BW, Roop DR, Proc Natl Acad Sci U S A. 1 999 Jul 20; 
96(15):8483-8), or tet promoter (Rendal et al., Hum 
Gene Ther. 2002 Jan;13(2):335-42. and Larnartina et 
a/., Hum Gene Ther. 2002 Jan; 13(2): 199-210) or a pro- 
moter disclosed in WO 00/63364, filed April 19, 2000. 
In desirable embodiments, the inducible promoter is not 
induced until all the episomal vectors are eliminated 
from the cell. The vector may also comprise a selectable 
marker. In addition, these vectors may be used in com- 
bination with methods that inhibit or prevent an interfer- 
on response or double stranded RNA stress response, 
as described herein. 

[0012] Desirably in a vector for use in any of the above 
aspects of the invention, the sense strand and the anti- 
sense strand of the nucleic acid sequence are tran- 
scribed from the same nucleic acid sequence using two 
convergent promoters. In another desirable embodi- 
ment, in a vector for use in any of the above aspects of 
the invention, the nucleic acid sequence comprises an 
inverted repeat, such that upon transcription, the nucleic 
acid forms a double stranded RNA. 
[0013] In still other embodiments of any of the above 
aspects of the invention, the cell and the vector each 
further comprise a toxP site and site-specific integration 
of the nucleic acid into a chromosome of the cell occurs 
through recombination between the toxP sites. In addi- 
tion, step (b) of any of the above aspects of the invention 
further involves rescuing the nucleic acid through Cre- 
mediated double recombination. 
[0014] In still further embodiments of any of the above 
aspects of the invention, the identified nucleic acid se- 
quence is located in the nucleus of the cell. Alternatively, 
the identified nucleic acid sequence may be located in 
the cytoplasm of the cell. 

[0015] In yet another embodiment of any of the above 
aspects of the invention, the nucleic acid from the dou- 
ble stranded RNA expression library is at least 1 00, 500, 
600, or 1000 nucleotides in length. In other embodi- 
ments of any of the above aspects of the invention, the 
nucleic acid from the double stranded RNA expression 
library is at least 10, 20, 30, 40, 50, 60, 70, 80, or 90 
nucleotides in length. In yet other embodiments, the 
number of nucleotides in the nucleic acid from the dou- 
ble stranded RNA expression library is between 5-100 
nucleotides, 15-100 nucleotides, 20-95 nucleotides, 
25-90 nucleotides, 35-85 nucleotides, 45-80 nucle- 
otides, 50-75 nucleotides, or 55-70 nucleotides, inclu- 
sive. In still other embodiments, the number of nucle- 
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otides in the nucleic acid from the double stranded RNA 
expression library is contained in one of the following 
ranges: 5-15 nucleotides, 15-20 nucleotides, 20-25 nu- 
cleotides, 25-35 nucleotides, 35-45 nucleotides, 45-60 
nucleotides, 60-70 nucleotides, 70-80 nucleotides, 
80-90 nucleotides, or 90-100 nucleotides, inclusive. In 
other embodiments, the nucleic acid contains less than 
50,000; 1 0,000; 5,000; or 2,000 nucleotides. In addition, 
the nucleic acid from the double stranded RNA expres- 
sion library may contain a sequence that is less than a 
full length RNA sequence. 

[0016] In still further embodiments of any of the above 
aspects of the invention, the cell is a plant cell or an an- 
imal cell. Desirably the animal cell is a vertebrate or 
mammalian cell, for example, a human cell. The cell 
may be ex vivo or in vivo. The cell may be a gamete or 
a somatic cell, for example, a cancer cell, a stem cell, a 
cell of the immune system, a neuronal cell, a muscle 
cell, or an adipocyte. 

[0017] Transformation/transfection of the cell may oc- 
cur through a variety of means including, but not limited 
to, lipofection, DEAE-dextran-mediated transfection, 
microinjection, protoplast fusion, calcium phosphate 
precipitation, viral or retroviral delivery, electroporation, 
or biolistic transformation. The RNA or RNA expression 
vector (DNA) may be naked RNA or DN A or local anes- 
thetic complexed RNA or DNA (Pachuk era/., Biochim. 
Biophys. Acta 1468:20-30, 2000). In another embodi- 
ment, the cell is not a C. elegans cell. Desirably the ver- 
tebrate or mammalian cell has been cultured for only a 
small number of passages {e.g., less than 30 passages 
of a cell line that has been directly obtained from Amer- 
ican Type Culture Collection), or are primary cells. De- 
sirably, the vertebrate or mammalian cell is transformed 
with nucleic acids that are not complexed with cationic 
lipids. 

[0018] In yet another embodiment of any of the above 
aspects of the invention, the cell is derived from a parent 
cell, and is generated by: (a) transforming a population 
of parent cells with a bicistronic plasm id expressing a 
selectable marker and a reporter gene, and comprising 
a toxPsite; (b) selecting for a cell in which the plasmid 
is stably integrated; and (c) selecting for a cell in which 
one copy of the plasmid is stably integrated in a tran- 
scriptionally active locus. Desirably the selectable mark- 
er Is G41 8 and the reporter gene Is green fluorescent 
protein (GFP). 

[0019] In still another embodiment of the above as- 
pects of the Invention, generation of the double stranded 
expression library comprises: (a) isolating RNA from a 
cell; (b) synthesizing cDNAs from the RNA of step (a); 
and (c) cloning each cDNA into a vector. Desirably cD- 
NA synthesis is optimized and/or size selected for the 
generation and/or selection of cDNAs that are at least 
100, 500, 600, or 1000 nucleotides in length. In other 
embodiments, the cDNAs are least 10, 20, 30, 40, 50, 
60, 70, 80, or 90 nucleotides in length. In yet other em- 
bodiments, the number of nucleotides in the cDNAs is 



between 5-100 nucleotides, 15-100 nucleotides, 20-95 
nucleotides, 25-90 nucleotides. 35-85 nucleotides, 
45-80 nucleotides, 50-75 nucleotides, or 55-70 nucle- 
otides, inclusive. In still other embodiments, the number 
5 of nucleotides in the cDNAs is contained in one of the 
following ranges: 5-15 nucleotides, 15-20 nucleotides, 
20-25 nucleotides, 25-35 nucleotides, 35-45 nucle- 
otides, 45-60 nucleotides, 60-70 nucleotides, 70-80 nu- 
cleotides, 80-90 nucleotides, or 90-100 nucleotides, in- 
fo elusive. In other embodiments, the cDNAs contain less 
than 50,000; 10,000; 5,000; or 2,000 nucleotides. In ad- 
dition, the cDNA may encode an RNA fragment that is 
less than full length. Desirably the vector comprises two 
convergent T7 promoters, two convergent SP6 promot- 
es ers, or one convergent T7 promoter and one convergent 
SP6 promoter, a selectable marker, and/or a foxPsite. 
[0020] In an additional embodiment of any of the 
above aspects of the invention, the method is carried 
out under conditions that inhibit or prevent an interferon 
20 response or double stranded RNA stress response. 
[0021] In a fourth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the function of a cell, involving: (a) transforming 
a population of cells with a double stranded RNA that is 
25 derived from the cells; (b)optionally selecting for a cell 
in which the nucleic acid is expressed; and (c) assaying 
for a modulation in the function of the ceil, wherein the 
modulation identifies a nucleic acid sequence that mod- 
ulates the function of a cell, wherein the method is de- 
30 sirably carried out under conditions that inhibit or pre- 
vent an interferon response or double stranded RNA 
stress response. 

[0022] In a desirable embodiment of the fourth aspect 
of the invention, assaying for a modulation in the func- 

35 tion of a cell comprises measuring cell motility, apopto- 
sis, cell growth, cell invasion, vascularization, cell cycle 
events, cell differentiation, cell dedifferentiation, neuro- 
nal cell regeneration, or the ability of a cell to support 
viral replication. 

40 [0023] In a fifth aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 
lates expression of a target nucleic acid in a cell, involv- 
ing: (a) transforming a population of cells with a double 
stranded RNA that is derived from the cells; (b)optionally 

45 selecting for a cell In which the nucleic acid is expressed; 
and (c) assaying for a modulation in the expression of 
the gene in the cell, wherein the modulation identifies a 
nucleic acid sequence that modulates expression of a 
target nucleic acid in a cell, wherein the method is de- 

50 sirably carried out under conditions that inhibit or pre- 
vent an interferon response or double stranded RNA 
stress response. 

[0024] In a desirable embodiment of the fifth aspect 
of the invention, the target nucleic add is assayed using 
55 DNA array technology. 

[0025] In a sixth aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 
lates the biological activity of a target polypeptide in a 
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cell, involving: (a) transforming a population of cells with 
a double stranded RNA that is derived from the cells; 
(b)optionally selecting for a cell in which the nucleic acid 
is expressed in the cell; and (c) assaying for a modula- 
tion in the biological activity of a target polypeptide in 
the cell, wherein the modulation identifies a nucleic acid 
sequence that modulates the biological activity of a tar- 
get polypeptide in a cell, wherein the method is desirably 
carried out under conditions that inhibit or prevent an 
interferon response or double stranded RNA stress re- 
sponse. 

[0026] In a seventh aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the function of a cell, involving: (a) transforming 
a population of cells with a double stranded RNA; (b) 
optionally selecting for a cell in which the nucleic acid is 
expressed; and (c) assaying for a modulation in the 
function of the cell. Desirably, the modulation identifies 
a nucleic acid sequence that modulates the function of 
a ceil, wherein the method is desirably carried out under 
conditions that or prevent an interferon response or dou- 
ble stranded RNA stress response. 
[0027] In a desirable embodiment of the seventh as- 
pect of the invention, assaying for a modulation in the 
function of a cell comprises measuring cell motility, ap- 
optosis, cell growth, cell invasion, vascularization, cell 
cycle events, cell differentiation, cell dedifferentiation, 
neuronal cell regeneration, or the ability of a cell to sup- 
port viral replication. 

[0028] In a eighth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates expression of a target nucleic acid in a cell, in- 
volving: (a) transforming a population of cells with a dou- 
ble stranded RNA; (b)optionally selecting for a cell in 
which the nucleic acid is expressed; and (c) assaying 
for a modulation in the expression of the gene in the cell, 
wherein the modulation identifies a nucleic acid se- 
quence that modulates expression of a target nucleic 
acid in a cell. Desirably, the method is carried out under 
conditions that inhibit or prevent an interferon response 
or double stranded RNA stress response. 
[0029] In a desirable embodiment of the eighth aspect 
of the invention, the target nucleic acid is assayed using 
DNA array technology. 

[0030] In a ninth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the biological activity of a target polypeptide in a 
cell, involving: (a) transforming a population of cells with 
a double stranded RNA; (b)optionally selecting for a cell 
in which the nucleic acid is expressed in the cell; and (c) 
assaying for a modulation in the biological activity of a 
target polypeptide in the cell, wherein the modulation 
identifies a nucleic acid sequence that modulates the 
biological activity of a target polypeptide in a cell. Desir- 
ably, the method is carried out under conditions that in- 
hibit or prevent an interferon response double stranded 
RNA stress response. 

[0031] In one embodiment of any of the above as- 



pects of the invention, in step (a) at least 2, more desir- 
ably 50; 100; 500; 1000; 10,000; or 50,000 cells of the 
population of cells are each transformed with a different 
double stranded RNA from a double stranded RNA ex- 
5 pression library. Desirably, at most one double stranded 
RNA is inserted into each cell. In other embodiments, 
the population of cells is transformed with at least 5%, 
more desirably at least 25%, 50%, 75%, or 90%, and 
most desirably, at least 95% of the double stranded RNA 
10 expression library. In still another embodiment of any of 
the fourth, fifth, or sixth aspects of the invention, the 
method further involves: (d) identifying the nucleic acid 
sequence by amplifying and cloning the sequence. De- 
sirably amplification of the sequence involves the use of 
*5 the polymerase chain reaction (PCR). 

[0032] In a tenth aspect, the invention features a cell 
or a population of cells that expresses a double stranded 
RNA that (i) modulates a function of the cell, (il) modu- 
lates the expression of a target nucleic acid {e.g., an en- 
20 dogenous or pathogen gene) in the cell, and/or (iii) mod- 
ulates the biological activity of a target protein (e.g., an 
endogenous or pathogen protein) in the cell. Desirably, 
the cell contains only one molecular species of double 
stranded RNA or only one copy of a double stranded 
25 RNA expression vector {e.g., a stably integrated vector). 
Desirably, the cell or population of cells is produced us- 
ing one or more methods of the invention. In other em- 
bodiments, the double stranded RNA is expressed un- 
der conditions that inhibit or prevent an interferon re- 
30 sponse or a double stranded RNA stress response 
[0033] In other embodiments of any of the fourth, fifth, 
sixth, seventh, eighth, ninth, or tenth aspects of the in- 
vention, the double stranded RNA is derived from cD- 
NAs or randomized nucleic acids. In addition, the double 
35 stranded RNA may be a cytoplasmic double stranded 
RNA, in which case the double stranded nucleic acid is 
made in the cytoplasm. The double stranded RNA may 
be made in vitro or in vivo. In addition, the identified nu- 
cleic acid sequence may be located in the cytoplasm of 
to the cell. 

[0034] In still another embodiment of any of the fourth , 
fifth, sixth, seventh, eighth, ninth, ortenth aspects of the 
invention, the nucleic acid is contained in a vector, for 
example, a double stranded RNA expression vector that 

45 is capable of forming a double stranded RNA. Desirably 
the double stranded RNA expression vector comprises 
at least one promoter. The promoter may be a T7 pro- 
moter, in which case, the cell further comprises T7 
polymerase. Alternatively, the promoter may be an SP6 

50 promoter, in which case, the cell further comprises SP6 
polymerase. The promoter may also be one convergent 
T7 promoter and one convergent SP6 promoter. A cell 
may be made to contain T7 or SP6 polymerase by trans- 
forming the cell with a T7 polymerase or an SP6 

55 polymerase expression plasmid, respectively. The vec- 
tor may also comprise a selectable marker, for example 
hygromycin. 

[0035] Desirably in a vector for use in any of the fourth, 
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fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the sense strand and the antisense strand of 
the nucleic acid sequence are transcribed from the 
same nucleic acid sequence using two convergent pro- 
moters. In another desirable embodiment, in a vector for 5 
use in any of the above aspects of the invention, the 
nucleic acid sequence comprises an inverted repeat, 
such that upon transcription, the nucleic acid forms a 
double stranded RNA. 

[0036] In yet another embodiment of any of the fourth, 10 
fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the double stranded RNA is at least 100, 500, 
600, or 1000 nucleotides in length. In other embodi- 
ments of any of the fourth, fifth, or sixth aspects of the 
invention, the double stranded RNA is at least 10, 20, '5 
30, 40, 50, 60, 70, 80, or 90 nucleotides in length. In yet 
other embodiments, the number of nucleotides in the 
double stranded RNA is between 5-100 nucleotides, 
15-100 nucleotides, 20-95 nucleotides, 25-90 nucle- 
otides, 35-85 nucleotides, 45-80 nucleotides, 50-75 nu- 20 
cleotides, or 55-70 nucleotides, inclusive. In still other 
embodiments, the number of nucleotides in the double 
stranded RNA is contained in one of the following rang- 
es: 5-15 nucleotides, 15-20 nucleotides, 20-25 nucle- 
otides, 25-35 nucleotides, 35-45 nucleotides, 45-60 nu- 25 
cleotides, 60-70 nucleotides, 70-80 nucleotides, 80-90 
nucleotides, or 90-100 nucleotides, inclusive. In other 
embodiments, the double stranded RNA contains less 
than 50,000; 10,000; 5,000; or 2,000 nucleotides. In ad- 
dition, the double stranded RNA may contain a se- 30 
quence that is less than a full length RNA sequence. 
[0037] In still further embodiments of any of the fourth, 
fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the cell is a plant cell or an animal cell. Desir- 
ably the animal cell is a vertebrate or mammalian cell, 35 
for example, a human cell. The cell may be ex vivo or in 
vivo. The cell may be a gamete or a somatic cell for 
example, a cancer cell, a stem cell, a cell of the immune 
system, a neuronal cell, a muscle cell, or an adipocyte. 
[0038] In other embodiments of any of the first, sec- 40 
ond, third, seventh, eighth, ninth, or tenth aspects of the 
invention, the double stranded RNA is derived from a 
cell or a population of cells and used to transform an- 
other cell population of either the same cell type or a 
different cell type. In desirable embodiments, the trans- 45 
formed cell population contains cells of a cell type that 
is related to the cell type of the ceils from which the dou- 
ble stranded RNA was derived (e.g., the transformation 
of cells of one neuronal cell type with the double strand- 
ed RNA derived from cells of another neuronal cell type), so 
In yet other embodiments of any of these aspects, the 
double stranded RNA contains one or more contiguous 
or non-contiguous positions that are randomized (e.g., 
by chemical or enzymatic synthesis using a mixture of 
nucleotides that may be added at the randomized posi- 55 
tion). In still other embodiments, the double stranded 
RNA is a randomized nucleic acid in which segments of 
ribonucleotides and/or deoxy ribonucleotides are ligated 



to form the double stranded RNA. 
[0039] In other embodiments of any of various as- 
pects of the invention, the double stranded RNA specif- 
ically hybridizes to a target nucleic acid but does not 
substantially hybridize to non-target molecules, which 
include other nucleic acids in the cell or biological sam- 
ple having a sequence that is less than 99, 95, 90, 80, 
or 70% identical or complementary to that of the target 
nucleic acid. Desirably, the amount of the these non-tar- 
get molecules hybridized to, or associated with, the dou- 
ble stranded RNA, as measured using standard assays, 
is 2-fold, desirably 5-fold, more desirably 10-fold, and 
most desirably 50-fold lower than the amount of the tar- 
get nucleic acid hybridized to, or associated with, the 
double stranded RNA. In other embodiments, the 
amount of a target nucleic acid hybridized to, or associ- 
ated with, the double stranded RNA, as measured using 
standard assays, is 2-fold, desirably 5-fold, more desir- 
ably 1 0-fold, and most desirably 50-fold greater than the 
amount of a control nucleic acid hybridized to, or asso- 
ciated with, the double stranded RNA. Desirably, the 
double stranded RNA only hybridizes to one target nu- 
cleic acid from a cell under denaturing, high stringency 
hybridization conditions. In certain embodiments, the 
double stranded RNA is substantially homologous (e.g., 
at least 80, 90, 95, 98,or 1 00% homologous) to only one 
target nucleic acid from a cell. In other embodiments, 
the double stranded RNA is homologous to multiple 
RNAs, such as RNAs from the same gene family. In yet 
other embodiments, the double stranded RNA is homol- 
ogous to distinctly different mRNA sequences from 
genes that are similarly regulated (e.g., developmental, 
chromatin remodeling, or stress response induced). In 
other embodiments, the double stranded RNA is homol- 
ogous to a large number of RNA molecules, such as a 
double stranded RNA designed to induce a stress re- 
sponse or apoptosis. In other embodiments, the percent 
decrease in the expression of a target nucleic acid is at 
least 2, 5, 10, 20, or 50 fold greater than the percent 
decrease in the expression of a non-target orcontrol nu- 
cleic acid. Desirably, the double stranded RNA inhibits 
the expression of a target nucleic acid but has negligible, 
if any, effect on the expression of other nucleic acids in 
the cell. Examples of control nucleic acids include nu- 
cleic acids with a random sequence or nucleic acids 
known to have little, if any, affinity for the double strand- 
ed RNA. 

[0040] In other embodiments of any of various as- 
pects of the invention, at most one molecular species of 
double stranded RNA is inserted into each cell. In other 
embodiments, at most one vector is stably integrated 
into the genome of each cell. In various embodiments, 
the double stranded RNA is active in the nucleus of the 
transformed cell and/or is active in the cytoplasm of the 
transformed cell. In various embodiments, at least 1,10, 
20, 50, 100, 500, or 1000 cells or all of the cells in the 
population are selected as cells that contain or express 
a double stranded RNA. In some embodiments, at least 
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1.10. 20, 50, 100, 500, or 1000 cells or all of the cells 
in the population are assayed for a modulation in the 
function of the cell, a modulation in the expression of a 
target nucleic acid (e.g., an endogenous or pathogen 
gene) in the cell, and/or a modulation in the biological 
activity of a target protein (e.g. t an endogenous or path- 
ogen protein) in the cell. 

[0041] In other embodiments, the double stranded 
RNA or double stranded RNA expression vector is com- 
plexed with one or more cationic lipids or cationic am- 
phiphiles, such as the compositions disclosed in US 
4,897,355 (Eppstein etal. t filed October 29, 1987), US 
5,264,618 (Feigner et a/., filed April 16, 1991) or US 
5,459,127 (Feigner e/aA, filed September 16, 1993). In 
other embodiments, the double stranded RNA or double 
stranded RNA expression vector is complexed with a 
liposomes/iiposomic composition that includes a cation- 
ic lipid and optionally Includes another component such 
as a neutral lipid (see, for example, US 5,279,833 
(Rose), US 5,283,185 (Epand), and US 5,932,241). In 
yet other embodiments, the double stranded RNA or 
double stranded RNA expression vector is complexed 
with any other composition that is devised by one of or- 
dinary skill in the fields of pharmaceutics and molecular 
biology. 

[0042] Desirably, the double stranded RNA specifical- 
ly hybridizes to a target nucleic acid but does not sub- 
stantially hybridize to non-target molecules, which in- 
clude other nucleic acids in the cell or biological sample 
having a sequence that is less than 99, 95, 90, 80, or 
70 % identical to or complementary to that of the target 
nucleic acid. In other embodiments, the percent de- 
crease in the expression of a target nucleic acid is at 
least 2, 5, 10, 20, or 50 fold greater than the percent 
decrease in the expression of a non-target or control nu- 
cleic acid. Desirably, the double stranded RNA inhibits 
the expression of the target nucleic acid but has negli- 
gible, If any, effect on the expression of other nucleic 
acids in the cell. 

[0043] TransformationAransf ection of the cell may oc- 
cur through a variety of means including, but not limited 
to, lipofection, DEAE-dextran-mediated transfection, 
microinjection, protoplast fusion, calcium phosphate 
precipitation, viral or retroviral delivery, electroporation, 
or biolistic transformation. The RNA or RNA expression 
vector (DNA) may be naked RNA or DNA or local anes- 
thetic complexed RNA or DNA (Pachuk e/a/., supra). In 
yet another embodiment, the cell is not a C. elegansceW. 
Desirably the vertebrate or mammalian cell has been 
cultured for onty a small number of passages {e.g. t less 
than 30 passages of a cell line that has been directly 
obtained from American Type Culture Collection), or are 
primary cells. In addition, desirably the vertebrate or 
mammalian cell is transformed with double stranded 
RNA that is not complexed with cationic lipids. 
[0044] The transcription systems described herein 
provide advantages to other double stranded expres- 
sion systems. Following transformation of the double 



stranded RNA library, cells contain hundreds to thou- 
sands of double stranded RNA expression cassettes, 
with concomitant expression of that many expression 
cassettes. In the double stranded RNA expression sys- 
5 tern of the present invention, double stranded RNA 
(dsRNA) expression cassettes contained within the ex- 
pression vector integrate into the chromosome of the 
transfected cell. Desirably, every transformed cell inte- 
grates one of the double stranded expression cassettes. 

io Through expansion of the transformed cell, episomal 
(non-integrated) expression vectors are diluted out of 
the cell overtime. Desirably no transcription occurs until 
the episomal expression vectors are diluted out of the 
cell, such that not more than 5 episomal vectors remain 

'5 jn the cell. Most desirably, no transcription occurs until 
alt of the episomal vectors have been diluted out of the 
cell, and only the integrated expression cassette re- 
mains. The time it takes for all episomal vectors to be 
removed from the cell is proportional to the replication 

20 rate of the transformed cell, and is generally on the order 
of two to several weeks of cell culture and growth. The 
numbers of copies of a dsRNA molecule in a trans- 
formed cell can be determined using, for example, 
standard PCR techniques, and thereby, the number of 

25 episomal vectors in a given cell can be monitored. 
[0045] Once a stable integrant containing five or few- 
er, and desirably no episomal expression vectors, tran- 
scription is induced, allowing dsRNA to be expressed in 
the ceils. This method ensures that, if desired, only one 

30 species or not more than about five species of dsRNA 
is expressed per cell, as opposed to other methods that 
express hundreds to thousands of double stranded spe- 
cies. 

[0046] Another problem that can occur in other double 
35 stranded expression systems or dsRNA delivery sys- 
tems is that some dsRNA sequences, possibly in certain 
cell types and through certain delivery methods, may re- 
sult in an interferon response (Jaramillo et a/., Cancer 
Invest. 13:327-338. 1995). During the induction of post- 
^0 transcriptional gene silencing events, induction of an in- 
terferon response is not desired, as this could lead to 
cell death and possibly to the prevention of gene silenc- 
ing. An additional advantage of the present invention is 
that the dsRNA delivery methods described herein are 
45 performed such that an interferon response Is inhibited 
or prevented. 

[0047] One of the components of an interferon re- 
sponse is the induction of the interferon-induced protein 
kinase PKR (Jaramillo era/., supra). Suppression of the 

so interferon response and/or the PKR response, using 
techniques described herein, is desired in the cells tar- 
geted for a PTGS event in those instances where an 
interferon response would otherwise be induced. Meth- 
ods for suppressing an interferon response or dsRNA 

55 stress response can be used in combination with any of 
the methods for identifying a nucleic acid sequence that 
modulates the function of a cell, gene expression in a 
cell, or the biological activity of a target polypeptide. 
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[004S] The methods of the present invention provide 
a means for high throughput identification of nucleic acid 
sequences involved in modulating the function of a celL 
the expression of a target nucleic acid in a cell, or the 
biological activity of a target polypeptide in a cell. By 
transforming a population of cells with a double stranded 
RNA expression library, the effects of many PTGS 
events on cell function, expression of a target nucleic 
acid in a cell, or the biological activity of a target polypep- 
tide in a cell can be evaluated simultaneously, thereby 
allowing for rapid identification of the nucleic acid se- 
quence involved in a cell function, target nucleic acid 
expression, or biological activity of a target polypeptide 
of interest. 

[0049] By "nucleic acid," "nucleic acid sequence," 
"double stranded RNA nucleic acid sequence," or "dou- 
ble stranded RNA nucleic acid" is meant a nucleic acid 
or a portion thereof that is free of the genes that, in the 
naturally-occurring genome of the organism from which 
the nucleic acid sequence of the invention is derived, 
flank the gene. The term therefore includes, for exam- 
ple, a recombinant DNA, with or without 5' or 3' flanking 
sequences that is incorporated into a vector, for exam- 
ple, a double stranded RNA expression vector; into an 
autonomously replicating plasmid or virus; or into the 
genomic DNA of a prokaryote or eukaryote; or which ex- 
ists as a separate molecule (e.g., a cDNA or a genomic 
or cDNA fragment produced by PCR or restriction en- 
donuclease digestion) independent of other sequences. 
[0050] By "double stranded RNA" is meant a nucleic 
acid containing a region of two or more nucleotides that 
are in a double stranded conformation. In various em- 
bodiments, the double stranded RNA consists entirely 
of ribonucleotides or consists of a mixture of ribonucle- 
otides and deoxynucleotides, such as the RNA/DNA hy- 
brids disclosed, for example, by WO 00/63364, filed 
April 19, 2000 or U.S.S.N. 60/130,377, filed April 21, 
1 999. The double stranded RNA may be a single mole- 
cule with a region of self-complimentarity such that nu- 
cleotides in one segment of the molecule base pair with 
nucleotides in another segment of the molecule. In var- 
ious embodiments, a double stranded RNA that consists 
of a single molecule consists entirely of ribonucleotides 
or includes a region of ribonucleotides that is compli- 
mentary to a region of deoxyrlbonucleotides. Alterna- 
tively, the double stranded RNA may include two differ- 
ent strands that have a region of complimentarity to 
each other. In various embodiments, both strands con- 
sist entirely of ribonucleotides, one strand consists en- 
tirely of ribonucleotides and one strand consists entirely 
of deoxy ribonucleotides, or one or both strands contain 
a mixture of ribonucleotides and deoxyribonucleotides. 
Desirably, the regions of complimentarity are at least 70, 
80, 90, 95, 98, or 100% complimentary. Desirably, the 
region of the double stranded RNA that is present in a 
double stranded conformation includes at least 5, 10, 
20, 30, 50, 75,100, 200, 500, 1000, 2000 or 5000 nucle- 
otides or includes ail of the nucleotides in a cDNA being 



represented in the double stranded RNA. In some em- 
bodiments, the double stranded RNA does not contain 
any single stranded regions, such as single stranded 
ends, or the double stranded RNA is a hairpin. Desirable 
5 RNA/DNA hybrids include a DNA strand or region that 
is an antisense strand or region (e.g, has at least 70, 80, 
90, 95, 98, or 100% complimentary to a target nucleic 
acid) and an RNA strand or region that is an sense 
strand or region (e.g, has at least 70, 80, 90, 95, 98, or 

10 100% identity to a target nucleic acid). In various em- 
bodiments, the RNA/DNA hybrid is made in vitro using 
enzymatic or chemical synthetic methods such as those 
described herein or those described in WO 00/63364, 
filed April 19, 2000 or U.S.S.N. 60/130,377, filed April 

is 21,1 999. In other embodiments, a DNA strand synthe- 
sized in vitro is complexed with an RNA strand made in 
vivooT in vitro before, after, or concurrent with the trans- 
formation of the DNA strand into the cell. In yet other 
embodiments, the double stranded RNA is a single cir- 

20 cular nucleic acid containing a sense and an antisense 
region, or the double stranded RNA includes a circular 
nucleic acid and either a second circular nucleic acid or 
a linear nucleic acid (see : for example, WO 00/63364, 
filed April 19, 2000 or U.S.S.N. 60/130,377, filed April 

25 21 , 1999.) Exemplary circular nucleic acids include lariat 
structures in which the free 5' phosphoryl group of a nu- 
cleotide becomes linked to the 2' hydroxyl group of an- 
other nucleotide in a loop back fashion. 
[0051] In other embodiments, the double stranded 

30 RNA includes one or more modified nucleotides in which 
the 2' position in the sugar contains a halogen (such as 
flourine group) or contains an alkoxy group (such as a 
methoxy group) which increases the half-life of the dou- 
ble stranded RNA in vitro or in vivo compared to the cor- 

35 responding double stranded RNA in which the corre- 
sponding 2' position contains a hydrogen or an hydroxyl 
group. In yet other embodiments, the double stranded 
RNA includes one or more linkages between adjacent 
nucleotides other than a naturally-occurring phosphodi- 

40 ester linkage. Examples of such linkages include phos- 
phoramide, phosphorothioate, and phosphorodithioate 
linkages. In other embodiments, the double stranded 
RNA contains one or two capped strands, as disclosed, 
for example, by WO 00763364, filed April 1 9, 2000 or U. 

45 s.S.N. 60/1 3Ci;377, filed April 21, 1999. In other embod- 
iments, the double stranded RNA contains coding se- 
quence or non-coding sequence, for example, a regu- 
latory sequence (e.g., a transcription factor binding site, 
a promoter, or a 5' or 3' untranslated region (UTR) of an 

so mRNA). Additionally, the double stranded RNA can be 
any of the at least partially double-stranded RNA mole- 
cules disclosed in WO 00/63364, filed April 19, 2000 
(see, for example, pages 8-22). Any of the double 
stranded RNAs may be expressed in vitro or in vivo us- 

55 jng the methods described herein or standard methods, 
such as those described in WO 00/63364, filed April 19, 
2000 (see, for example, pages 16-22). 
[0052] By "double stranded RNA expression library" 
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or "dsRNA expression library" is meant a collection of 
nucleic acid expression vectors containing nucleic acid 
sequences, for example, cDNA sequences or rand- 
omized nucleic acid sequences that are capable of form- 
ing a double stranded RNA (dsRNA) upon expression 
of the nucleic acid sequence. Desirably the double 
stranded RNA expression library contains at least 
10,000 unique nucleic acid sequences, more desirably 
at least 50,000; 1 00,000; or 500,000 unique nucleic acid 
sequences, and most desirably, at least 1,000,000 
unique nucleic acid sequences. By a "unique nucleic ac- 
id sequence" is meant that a nucleic acid sequence of 
a double stranded RNA expression library has desirably 
less than 50%, more desirably less than 25% or 20%, 
and most desirably less than 10% nucleic acid identity 
to another nucleic acid sequence of a double stranded 
RNA expression library when the full length sequence 
are compared. Sequence identity is typically measured 
using sequence analysis software with the default pa- 
rameters specified therein (e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 
University Avenue, Madison, Wl 53705). This software 
program matches similar sequences by assigning de- 
grees of homology to various substitutions, deletions, 
and other modifications. 

[0053] The preparation of cDNAs for the generation 
of double stranded RNA expression libraries is de- 
scribed herein. A randomized nucleic acid library may 
also be generated as described in detail below. The dou- 
ble stranded RNA expression library may contain nucle- 
ic acid sequences that are transcribed in the nucleus or 
that are transcribed in the cytoplasm of the cell. A double 
stranded RNA expression library may be generated us- 
ing techniques described herein. 
[0054] By "target nucleic acid" is meant a nucleic acid 
sequence whose expression is modulated as a result of 
post-transcriptional gene silencing. As used herein, the 
target nucleic acid may be in the cell in which the PTGS 
event occurs or it may be in a neighboring cell, or in a 
cell contacted with media or other extracellular fluid in 
which the cell that has undergone the PTGS event is 
contained. Exemplary target nucleic acids include nu- 
cleic acids associated with cancer or abnormal cell 
growth, such as oncogenes, and nucleic acids associ- 
ated with an autosomal dominant or recessive disorder. 
Desirably, the double stranded RNA inhibits the expres- 
sion of an allele of a nucleic acid that has a mutation 
associated with a dominant disorder and does not sub- 
stantially inhibit the other allele of the nucleic acid (e.g, 
an allele without a mutation associated with the disor- 
der). Other exemplary target nucleic acids include host 
cellular nucleic acids or pathogen nucleic acids required 
for the infection or propagation of a pathogen, such as 
a virus, bacteria, yeast, protozoa, or parasite. 
[0055] By "target polypeptide" is meant a polypeptide 
whose biological activity is modulated as a result of post- 
transcriptional gene silencing. As used herein, the target 



polypeptide may be in the cell in which the PTGS event 
occurs or it may be in a neighboring cell, or in a cell con- 
tacted with media or other extracellular fluid in which the 
cell that has undergone the PTGS event is contained. 
5 [0056] As used herein, by "randomized nucleic acids" 
is meant nucleic acids, for example, those that are at 
least 100, 500, 600, or 1000 nucleotides in length, con- 
structed from RNA isolated from a particular cell type. 
In other embodiments, the nucleic acids are at least 10, 
*o 20, 30, 40, 50, 60, 70, 80, or 90 nucleotides in length. 
In yet other embodiments : the number of nucleotides in 
the nucleic acids is between 5-100 nucleotides, 15-100 
nucleotides, 20-95 nucleotides, 25-90 nucleotides, 
35-85 nucleotides, 45-80 nucleotides, 50-75 nucle- 
us otides, or 55-70 nucleotides, inclusive. In still other em- 
bodiments, the number of nucleotides in the nucleic ac- 
ids is contained in one of the following ranges: 5-15 nu- 
cleotides, 15-20 nucleotides, 20-25 nucleotides, 25-35 
nucleotides, 35-45 nucleotides, 45-60 nucleotides, 
20 60-70 nucleotides, 70-80 nucleotides, 80-90 nucle- 
otides, or 90-1 00 nucleotides : inclusive. In otherembod- 
iments, the nucleic acids contain less than 50,000; 
10,000; 5,000; or 2,000 nucleotides. A randomized nu- 
cleic acid library may be constructed in a number of 
ways. For example, it may be constructed from existing 
cDNA libraries. In one example, the cDNA libraries are 
shuffled using the "Gene Shuffling" technology of Max- 
ygen Corp. The cDNA sequences are amplified using 
inefficient PCR either by restricting elongation time or 
through the use of manganese. A library of recom- 
binants is created, and the library is finally amplified by 
PCR and cloned into vectors. In a second method, ex- 
isting cDNA libraries are digested with an endonuclease 
to generate fragments of 10 to 300 base pairs. Alterna- 
tively, the cDNA libraries are digested to generate short- 
er fragments of, for example, 5 to 50 base pairs, 5 to 40 
base pairs, 5 to 20 base pairs, 5 to 1 0 base pairs, or 10 
to 20 base pairs, inclusive. If the fragments are to con- 
tain 5' OH and 3' P0 4 groups, they are dephosphorylat- 
ed using alkaline phosphatase and phosphorylated us- 
ing polynucleotide kinase. These dsDNA fragments are 
then ligated to form larger molecules, and are size se- 
lected. In a third example, randomized nucleic acid li- 
braries are created by using random priming of cDNA 
libraries (using random hexamers and Klenow) to gen- 
erate short fragments of 20 to 1 00 nucleotides. Alterna- 
tively, shorter fragments are generated that contain, for 
example, 5 to 50 nucleotides, 5 to 40 nucleotides, 5 to 
20 nucleotides, 5 to 10 nucleotides, or 10 to 20 nucle- 
otides, inclusive. These fragments are then ligated ran- 
domly to give a desired sized larger fragment. 
[0057] Alternatively, a randomized nucleic acid library 
can be generated from random sequences of oligonu- 
cleotides. For example, DNA or RNA oligonucleotides 
may be prepared chemically. Random DNA sequences 
may also be prepared enzymatically using terminal 
transferase in the presence of all dNTPs. Random RNA 
molecules may be prepared using NDPs and NDPphos- 
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phorylase. The random sequences may be 10 to 300 
bases in length. Alternatively, shorter random sequenc- 
es are used that contain, for example, 5 to 50 bases, 5 
to 40 bases, 5 to 20 bases, 5 to 1 0 bases, or 1 0 to 20 
bases, inclusive. The sequences are ligated to form the 
desired larger sequence using RNA ligase. Alternatively 
these sequences may be ligated chemically. The oligo- 
nucleotides are phosphorylated at the 5* position using 
polynucleotide kinase or by chemical methods, prior to 
ligation enzymatically. Chemical ligations can utilize a 5' 
P0 4 and a 3' OH group or a5' OH and a 3' P0 4 group. 
[0058] Alternatively, a randomized nucleic acid library 
can be generated by converting the random DNA se- 
quences into dsDNA sequences using DNA polymerase 
(Klenow), dNTP and random heteromeric primers, and 
the RNA sequences are converted into dsDNA se- 
quences by reverse transcriptase and Klenow. After 
converting into DNA (ss or ds) the sequences are then 
amplified by PCR. The dsDNA fragments can also be 
ligated to give larger fragments of a desired size. 
[0059] The randomized nucleic acids may be cloned 
into a vector, for example . an expression vector, as a 
double stranded RNA transcription cassette. The se- 
quence of the nucleic acid may not be known at the time 
the vector is generated. The randomized nucleic acid 
may contain coding sequence or non-coding sequence, 
for example, a regulatory sequence (e.g., a transcription 
factor binding site, a promoter, or a 5* or 3' untranslated 
region (UTR)of an mRNA). 

[0060] By "Cre-mediated double recombination" is 
meant two nucleic acid recombination events involving 
/oxPsites that are mediated by Cre recombinase. A Cre- 
mediated double recombination event can occur, for ex- 
ample, as illustrated in Fig. 1 . 

[0061] By "function of a cell' is meant any cell activity 
that can be measured or assessed. Examples of cell 
function include, but are not limited to, cell motility, ap- 
optosis, cell growth, cell invasion, vascularization, cell 
cycle events, cell differentiation, cell dedifferentiation, 
neuronal cell regeneration, and the ability of a cell to 
support viral replication. The function of a cell may also 
be to affect the function, gene expression, or the 
polypeptide biological activity of another cell, for exam- 
ple, a neighboring cell, a cell that is contacted with the 
cell, or a cell that is contacted with media or other ex- 
tracellular fluid that the cell is contained In. 
[0062] By "apoptosis" is meant a cell death pathway 
wherein a dying cell displays a set of well-characterized 
biochemical hallmarks that include cytolemmal mem- 
brane blebbing, cell soma shrinkage, chromatin con- 
densation, nuclear disintegration, and DNA laddering. 
There are many well-known assays for determining the 
apoptotic state of a cell, including, and not limited to: 
reduction of MTT tetrazolium dye, TUNEL staining, An- 
nexin V staining, propidium iodide staining, DNA ladder- 
ing, PARP cleavage, caspase activation, and assess- 
ment of cellular and nuclear morphology. Any of these 
or other known assays may be used in the methods of 



the invention to determine whether a cell is undergoing 
apoptosis. 

[0063] By "polypeptide biological activity" is meant the 
ability of a target polypeptide to modulate cell function. 
5 The level of polypeptide biological activity may be direct- 
ly measured using standard assays known in the art. 
For example, the relative level of polypeptide biological 
activity may be assessed by measuring the level of the 
mRNA that encodes the target polypeptide (e.g., by re- 
10 verse transcription-polymerase chain reaction 
(RT-PCR) amplification or Northern blot analysis); the 
level of target polypeptide (e.g., by ELISA or Western 
blot analysis); the activity of a reporter gene under the 
transcriptional regulation of a target polypeptide tran- 
'5 scriptional regulatory region (e.g., by reporter gene as- 
say, as described below); the specific interaction of a 
target polypeptide with another molecule, for example, 
a polypeptide that is activated by the target polypeptide 
or that inhibits the target polypeptide activity (e.g., by 
20 the two-hybrid assay); or the phosphorylation or glyco- 
sylation state of the target polypeptide. A compound, 
such as a dsRNA, that increases the level of the target 
polypeptide, mRNA encoding the target polypeptide, or 
reporter gene activity within a cell, a cell extract, or other 
25 experimental sample is a compound that stimulates or 
increases the biological activity of a target polypeptide. 
A compound, such as a dsRNA, that decreases the level 
of the target polypeptide, mRNA encoding the target 
polypeptide, or reporter gene activity within a cell, a cell 
extract, or other experimental sample is a compound 
that decreases the biological activity of a target polypep- 
tide. 

[0064] By "assaying" is meant analyzing the effect of 
a treatment, be it chemical or physical, administered to 
whole animals, cells, tissues, or molecules derived 
therefrom. The material being analyzed may be an an- 
imal, a cell, a tissue, a lysate or extract derived from a 
cell, or a molecule derived from a cell. The analysis may 
be, for example, for the purpose of detecting altered cell 
function, altered gene expression, altered endogenous 
RNA stability, altered polypeptide stability, altered 
polypeptide levels, or altered polypeptide biological ac- 
tivity. The means for analyzing may include, for exam- 
ple, antibody labeling, immunopreciprtation, phosphor- 
ylation assays, glycosylatlon assays, and methods 
known to those skilled in the art for detecting nucleic 
acids. In some embodiments, assaying is conducted un- 
der selective conditions. 

[0065] By "modulates" is meant changing, either by a 
decrease or an increase. As used herein, desirably a 
nucleic acid sequence decreases the function of a cell, 
the expression of a target nucleic acid in a cell, or the 
biological activity of a target polypeptide in a cell by least 
20%, more desirably by at least 30%, 40%, 50%, 60% 
or 75%, and most desirably by at least 90%. Also as 
used herein, desirably a nucleic acid sequence increas- 
es the function of a cell, the expression of a target nu- 
cleic acid in a cell, or the biological activity of a target 
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polypeptide in a cell by at least 1.5-fold to 2-fold, more 
desirably by at least 3-fold, and most desirably by at 
least 5-fold. 

[0066] By "a decrease" is meant a lowering in the level 
of: a) protein (e.g., as measured by ELISA or Western 
blot analysis); b) reporter gene activity (e.g., as meas- 
ured by reporter gene assay, for example, p-galactosi- 
dase, green fluorescent protein, or lucif erase activity); 
c) mRNA (e.g., as measured by RT-PCR or Northern 
blot analysis relative to an internal control, such as a 
"housekeeping" gene product, for example, p-actin or 
glyceraldehyde 3-phosphate dehydrogenase (GAP- 
DH)); or d) cell function, for example, as assayed by the 
number of apoptotic, mobile, growing, cell cycle arrest- 
ed, invasive, differentiated, or dedifferentiated cells in a 
test sample. In all cases, the lowering is desirably by at 
least 20%, more desirably by at least 30%, 40%, 50%, 
60%, 75%, and most desirably by at least 90%. As used 
herein, a decrease may be the direct or indirect result 
of PTGS. 

[0067] By "an increase" is meant a rise in the level of: 
a) protein (e.g., as measured by ELISA or Western blot 
analysis); b) reporter gene activity (e.g., as measured 
by reporter gene assay, for example, p-galactosidase, 
green fluorescent protein, or luciferase activity); c) mR- 
NA (e.g., as measured by RT-PCR or Northern blot anal- 
ysis relative to an internal control, such as a "house- 
keeping" gene product, for example, {J-actin or glycer- 
aldehyde 3-phosphate dehydrogenase (GAPDH)); or d) 
cell function, for example, as assayed by the number of 
apoptotic, mobile, growing, cell cycle arrested, invasive, 
differentiated, or dedifferentiated cells in a test sample. 
Desirably, the increase is by at least 1.5-fold to 2-fold, 
more desirably by at least 3-fold, and most desirably by 
at least 5-fold. As used herein, an increase may be the 
indirect result of PTGS. For example, the double strand- 
ed RNA may inhibit the expression of a protein, such as 
a suppressor protein, that would otherwise inhibitthe ex- 
pression of another nucleic acid. 
[0068] By "alteration in the level of gene expression" 
is meant a change in transcription, translation, or mRNA 
or protein stability such that the overall amount of a prod- 
uct of the gene, i.e., mRNA or polypeptide, is increased 
or decreased. 

[0069] By "reporter gene" is meant any gene that en- 
codes a product whose expression Is detectable and/or 
able to be quantitated by immunological, chemical, bio- 
chemical, or biological assays. A reporter gene product 
may, for example, have one of the following attributes, 
without restriction: fluorescence (e.g., green fluorescent 
protein), enzymatic activity (e.g., 0-galactosidase, luci- 
ferase, chloramphenicol acetyltransferase), toxicity (e. 
g., ricin A), or an ability to be specifically bound by an 
additional molecule (e.g., an unlabeled antibody, fol- 
lowed by a labelled secondary antibody, or biotin, or a 
detectably labelled antibody). It is understood that any 
engineered variants of reporter genes that are readily 
available to one skilled in the art, are also included, with- 



out restriction, in the foregoing definition. 
[0070] By "protein" or "polypeptide" or "polypeptide 
fragment" is meant any chain of more than two amino 
acids, regardless of posMranslational modification (e. 
5 9-, glycosylation or phosphorylation), constituting all or 
part of a naturally-occurring polypeptide or peptide, or 
constituting a non-naturally occurring polypeptide or 
peptide. 

[0071] By "promoter is meant a minimal sequence 
10 sufficient to direct transcription of a gene. Also included 
in this definition are those transcription control elements 
(e.g., enhancers) that are sufficient to render promoter- 
dependent gene expression controllable in a cell type- 
specific, tissue-specific, or temporal-specific manner, or 
'5 that are inducible by external signals or agents; such 
elements, which are well-known to skilled artisans, may 
be found in a 5* or 3' region of a gene or within an intron. 
Desirably a promoter is operabty linked to a nucleic acid 
sequence, for example, a cDNA or a gene in such a way 
20 as to permit expression of the nucleic acid sequence. 
[0072] By "operably linked" is meant that a gene and 
one or more transcriptional regulatory sequences, e.g., 
a promoter or enhancer, are connected in such a way 
as to permit gene expression when the appropriate mol- 
25 ecules (e.g., transcriptional activator proteins) are 
bound to the regulatory sequences. 
[0073] By "expression vector" is meant a DNA con- 
struct that contains at least one promoter operably 
linked to a downstream gene or coding region (e.g., a 
cDNA or genomic DNA fragment that encodes a protein, 
optionally, operatrvely linked to sequence lying outside 
a coding region, an antisense RNA coding region, or 
RNA sequences lying outside a coding region). Trans- 
fection or transformation of the expression vector into a 
recipient cell allows the cell to express RNA encoded by 
the expression vector. An expression vector may be a 
genetically engineered plasmid, virus, or artificial chro- 
mosome derived from, for example, a bacteriophage, 
adenovirus, retrovirus, poxvirus, or herpesvirus. 
[0074] By "transformation" or "transfection" is meant 
any method for introducing foreign molecules into a cell 
(e.g., a bacterial, yeast, fungal, algal, plant insect, or 
animal cell, particularly a vertebrate or mammalian cell). 
The cell may be in an animal. Lipofection, DEAE-dex- 
tran-medlated transfection, microinjection, protoplast 
fusion, calcium phosphate precipitation, viral or retrovi- 
ral delivery, etectroporation, and biolistic transformation 
are just a few of the transformation/transfection meth- 
ods known to those skilled in the art. The RNA or RNA 
expression vector (DNA) may be naked RNA or DNA or 
local anesthetic complexed RNA or DNA (Pachuk et al., 
supra). Other standard transfonnation/transfection 
methods and other RNA and/or DNA delivery agents (e. 
g. t a cationic lipid, liposome, or bupivacaine) are de- 
scribed in WO 00/63364, filed April 19, 2000 (see, for 
example, pages 1 8-26). Commercially available kits can 
also be used to deliver RNA or DNA to a cell. For exam- 
ple, the Transmessenger Kit from Qiagen, an RNA kit 



35 



40 



45 



50 



11 



21 



EP1 229 134 A2 



22 



from Xeragon Inc., and an RNA kit from DNA Engine 
Inc. (Seattle, WA) can be used to introduce single or 
double stranded RNA into a cell. 
[0075] By "transformed cell" or "transfected cell" is 
meant a cell (or a descendent of a cell) into which a nu- s 
cleic acid molecule, for example, a double stranded 
RNA or double stranded expression vector has been in- 
troduced, by means of recombinant nucleic acid tech- 
niques. Such cells may be either stably or transiently 
transfected. w 
[0076] By "selective conditions" is meant conditions 
under which a specific cell or group of cells can be se- 
lected for. For example, the parameters of a fluores- 
cence-activated cell sorter (FACS) can be modulated to 
identify a specific cell or group of cells. Cell panning, a *5 
technique known to those skilled in the art, is another 
method that employs selective conditions. 
[0077] As use herein, by "optimized" is meant that a 
nucleic acid fragment is generated through inefficient 
first strand synthesis {e.g., reverse transcription (RT) 20 
and/or RT/second strand synthesis (RT-SSS) using Kle- 
now or other enzymes and/or RT-PCR or PCR, to be of 
a particular length. Desirably the length of the nucleic 
acid fragment is less than a full length cDNA or is 100, 
500, 600, or 1 000 nucleotides in length. In other embod- 25 
iments, the nucleic acid fragment is at least 10, 20, 30, 
40, 50, 60, 70, 80, or 90 nucleotides in length. In yet 
other embodiments, the number of nucleotides in the 
nucleic acid fragment is between 5-100 nucleotides, 
15-100 nucleotides, 20-95 nucleotides, 25-90 nude- 30 
otides, 35-85 nucleotides, 45-80 nucleotides, 50-75 nu- 
cleotides, or 55-70 nucleotides, inclusive. In still other 
embodiments, the number of nucleotides in the nucleic 
acid fragment is contained in one of the following rang- 
es: 5-15 nucleotides, 15-20 nucleotides, 20-25 nude- 35 
otides, 25-35 nucleotides, 35-45 nucleotides, 45-60 nu- 
cleotides, 60-70 nucleotides, 70-80 nucleotides, 80-90 
nucleotides, or 90-100 nucleotides, inclusive. In other 
embodiments, the nucleic acid fragment contains less 
than 50,000; 10,000; 5,000; or 2,000 nucleotides. Opti- 40 
mization of the length of a nucleic acid can be achieved 
during first strand or second strand synthesis of a de- 
sired nucleic acid by lowering Mg++ concentrations to 
no less than the nucleotide concentrations; by adding 
Mn++ to the reaction to achieve the desired size selec- *s 
tion {e.g., by replacing Mg ++ completely, or by adding 
Mn++ at varying concentrations along with Mg+ + ); by de- 
creasing and/or limiting concentrations of dNTP(s) to ef- 
fect the desired fragment size; by using various concen- 
trations of ddNTP(s) along with standard or optimal con- so 
centrations of dNTP(s), to achieve varying ratios, to ob- 
tain the desired fragment size; by using limited and con- 
trolled exonuclease digestion of the fragment following 
RT, RT-SSS, RT-PCR, or PCR; or by a combination of 
any of these methods. 55 
[0078] As used herein, by "sized selected' is meant 
that a nucleic acid of a particular size is selected for use 
in the construction of dsRNA expression libraries as de- 



scribed herein. Desirably the size selected nucleic acid 
is less than a full length cDNA sequence or at least 1 00, 
500, 600, or 1000 nucleotides in length. In other embod- 
iments, the nucleic acid is at least 1 0, 20, 30, 40, 50, 60, 
70, 80, or 90 nucleotides in length. In yet other embod- 
iments, the number of nucleotides in the nucleic acid is 
between 5-100 nucleotides, 15-100 nucleotides, 20-95 
nucleotides, 25-90 nucleotides, 35-85 nucleotides, 
45-80 nucleotides, 50-75 nucleotides, or 55-70 nucle- 
otides, inclusive. In still other embodiments, the number 
of nucleotides in the nucleic acid is contained in one of 
the following ranges: 5-15 nucleotides, 15-20 nucle- 
otides, 20-25 nucleotides, 25-35 nucleotides, 35-45 nu- 
cleotides, 45-60 nucleotides, 60-70 nucleotides, 70-80 
nucleotides, 80-90 nucleotides, or 90-100 nucleotides, 
inclusive. In other embodiments, the nucleic acid con- 
tains less than 50,000; 10,000; 5,000; or 2,000 nucle- 
otides. For example, a nucleic acid may be size selected 
using size exclusion chromatography {e.g.,- as size ex- 
clusion Sepharose matrices) according to standard pro- 
cedures (see, for example, Sambrook, Fritsch, and Ma- 
niatis, Molecular Cloning: A Laboratory Manual (2d ed.), 
Cold Spring Harbor Laboratory Press, Cold Spring Har- 
bor, NY, 1 989). 

[0079] By "under conditions that inhibit or prevent an 
interferon response or a dsRNA stress response" is 
meant conditions that prevent or inhibit one or more in- 
terferon responses or cellular RNA stress responses in- 
volving cell toxicity, cell death, an anti-proliferative re- 
sponse, or a decreased ability of a dsRNA to carry out 
a PTGS event. These responses include, but are not 
limited to, interferon induction (both Type 1 and Type II), 
induction of one or more interferon stimulated genes, 
PKR activation, 2'5'OAS activation, and any down- 
stream cellular and/or organismal sequelae that result 
from the activation/induction of one or more of these re- 
sponses. By "organismal sequelae" is meant any effect 
(s) in a whole animal, organ, or more locally {e.g., at a 
site of injection) caused by the stress response. Exem- 
plary manifestations include elevated cytokine produc- 
tion, local inflammation, and necrosis. Desirably the 
conditions that inhibit these responses are such that not 
more than 95%, 90%, 80%, 75%, 60%, 40%, or 25%, 
and most desirably not more than 10% of the cells un- 
dergo cell toxicity, cell death, or a decreased ability to 
carry out a PTGS event, compared to a cell not exposed 
to such interferon response inhibiting conditions, all oth- 
er conditions being equal {e.g., same cell type, same 
transformation with the same dsRNA expression library. 
[0080] Apoptosis, interferon induction, 2'5' OAS acti- 
vation/induction, PKR induction/activation, anti-prolifer- 
ative responses, and cytopathic effects are all indicators 
for the RNA stress response pathway. Exemplary as- 
says that can be used to measure the induction of an 
RNA stress response as described herein include a 
TUNEL assay to detect apoptotic cells, ELISA assays 
to detect the induction of alpha, beta and gamma inter- 
feron, ribosomal RNA fragmentation analysis to detect 
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activation of 2 , 5*OAS, measurement of phosphorylated 
e!F2a as an indicator of PKR (protein kinase RNA in- 
ducible) activation, proliferation assays to detect chang- 
es in cellular proliferation, and microscopic analysis of 
cells to identify cellular cytopathic effects (see, e.g., Ex- 5 
ample 1 1 ). Desirably, the level of an interferon response 
or a dsRNA stress response in a cell transformed with 
a double stranded RNA or a double straned RNA ex- 
pression vector is less than 20, 10, 5,or 2-fold greater 
than the corresponding level in a mock-transfected con- w 
trol cell under the same conditions, as measured using 
one of the assays described herein. In other embodi- 
ments, the level of an interferon response or a dsRNA 
stress response in a cell transformed with a double 
stranded RNA or a double straned RNA expression vec- « 
tor using the methods of the present invention is less 
than 500%, 200%, 100%, 50%, 25%, or 10%, greater 
than the corresponding level in a corresponding trans- 
formed cell that is not exposed to such interferon re- 
sponse inhibiting conditions, all other conditions being 20 
equal. Desirably, the double straned RNA does not in- 
duce a global inhibition of cellular transcription or trans- 
lation. 

[0081] By □ specifically hybridizes □ is meant a dou- 
ble stranded RNA that hybridizes to a target nucleic acid 25 
but does not substantially hybridize to other nucleic ac- 
ids in a sample (e.g., a sample from a cell) that naturally 
includes the target nucleic acid, when assayed under 
denaturing conditions. In one embodiment, the amount 
of a target nucleic acidhybridized to, or associated with, 30 
the cbuble stranded RNA, as measured using standard 
assays, is 2-fold, desirably 5-fold, more desirably 
10-fold, and most desirably 50-fold greater than the 
amount of a control nucleic acid hybridized to, or asso- 
ciated with, the double stranded RNA. 35 
[0082] By Ohigh stringency conditions Dis meant hy- 
bridization in 2X SSC at 40°C with a DNA probe length 
of at least 40 nucleotides. For other definitions of high 
stringency conditions, see F. Ausubel et a!., Current Pro- 
tocols in Molecular Biology, pp. 6.3.1-6.3.6, John Wiley *o 
& Sons, New York, NY 1994, hereby incorporated by 
reference. 

[0083] Conditions and techniques that can be used to 
prevent an interferon response or dsRNA stress re- 
sponse during the screening methods of the present in- 45 
ventlon are described herein. 

Brief Description of the Drawing 

[0084] Fig. 1 is a schematic representation of a strat- so 
egy to isolate clonally pure stable integrants that contain 
a single expression unit isolated from cells transfected 
with a double-stranded RNA encoding a cDNA library. 
[0085] Fig. 2 is a schematic illustration of the produc- 
tion of effector RNAs in cells expressing PSA. The PSA 55 
expression cassette used to create the transient PSA 
expression cell line is depicted at the top of the figure. 
Expression of PSA is driven by the HCMV IE promoter 



and the SV40 polyadenylation signal (pA). Only se- 
quences 3' of the PSA initiation codon have been used 
in these vectors. The effector RNA expression cassettes 
are shown below the PSA expression cassette and are 
designed to express PSA sense RNA, PSA antisense 
RNA, and PSA dsRNA. Expression of the effector RNAs 
is under the control of the T7 promoter (T7p). Transcrip- 
tion from T7p is catalyzed by T7 RNA polymerase, which 
is supplied by co-transfecting a T7 RNA polymerase ex- 
pression plasmid (not shown). Control effector RNA 
cassettes expressing irrelevant RNAs derived from the 
Herpes simplex virus glycoprotein D gene were included 
as controls. The 600 base pair sequence from the Her- 
pes simplex gD gene is from Herpes Simplex virus 2 
strain 12 and maps to the coding region downstream of 
the gD initiation codon. 

[0086] Fig. 3 is a bar graph illustrating silencing of 
PSA expression by dsRNA. PSA levels in the super- 
nates of transfected cells were determined by EL ISA 
and are plotted as percent expression of the PSA un- 
treated control. The PSA untreated control shown at the 
left is normalized to 100%. PSA levels in the supernates 
of cells transfected with the various effector PSA or con- 
trol RNAs are shown by the shaded and open bars re- 
spectively. All data shown is from day two post-transfec- 
tion. Data from later time points were similar to the day 
two time point. 

[0087] Fig. 4 is a schematic illustration of the RNA 
stress response pathway, also known as the Type 1 in- 
terferon response. 

Detailed Description of the Invention 

[0088] Post-transcriptional gene silencing (PTGS) 
can be used as a tool to identify and validate specific 
unknown genes involved in cell function, gene expres- 
sion, and polypeptide biological activity. Although the 
use of PTGS as a validation strategy is well documented 
in invertebrates and plants, its use in identification of 
genes that modulate cell function, gene expression, or 
polypeptide biological activity, as described below, is 
novel. Since novel genes are likely to be identified 
through the methods of the present invention, PTGS is 
developed for use in validation and to identify novel tar- 
gets for use in therapies for diseases, for example, can- 
cer, neurological disorders, obesity, leukemia, lympho- 
mas, and other disorders of the blood or immune sys- 
tem. 

[0089] The present invention features methods to 
identify unknown targets that result in the modulation of 
a particular phenotype, an alteration of gene expression 
in a cell, or an alteration in polypeptide biological activity 
in a cell, using either a library based screening approach 
or a non-library based approach to identify nucleic acids 
that induce gene silencing. The present invention also 
allows the determination of function of a given se- 
quence. These methods involve the direct delivery of in 
wfrotranscribed double stranded RNA (dsRNA), as well 
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as plasmid-based systems that direct the cell to make 
its own dsRNA. To avoid problems associated with 
transfection efficiency, plasmids are designed to contain 
a selectable marker to ensure the survival of only those 
cells that have taken up plasmid DNA. One group of 
plasmids directs the synthesis of dsRNA that is tran- 
scribed in the cytoplasm, while another group directs the 
synthesis of dsRNA that is transcribed in the nucleus. 

Identification of genes by assaying for a modulation in 
cell function 

[0090] Functional identification of novel genes can be 
accomplished through the use of a number of different 
assays. For example, cells may be assayed for cell mo- 
tility, apoptosis, cell growth, cell invasion, vasculariza- 
tion, cell cycle events, cell differentiation, cell dediffer- 
entiation, neuronal cell regeneration, or the ability to 
support viral replication, as well as other cell functions 
known in the art. Methods for carrying out such function- 
al assays are well known and are described, for exam- 
ple, in Platet and Garcia (Invasion Metastasis 18: 
198-208, 1998-1999); Harper et al. (Neuroscience 88: 
257-267, 1999); and Tomaselli et al. (J. Cell Biol. 105: 
2347-2358, 1987), and are also described below. 
[0091] Functional identification of nucleic acid se- 
quences involved in modulating a particular cell function 
may be carried out by comparing cells transfected with 
a dsRN A to control cells that have not been transformed 
with a dsRNA or that have been mock-transfected, in a 
functional assay. A cell that has taken up sequences un- 
related to a particular function will perform in the partic- 
ular assay in a manner similar to the control cell. A cell 
experiencing PTGS of a gene involved in the particular 
function will exhibit an altered ability to perform in the 
functional assay compared to the control. 
[0092] The percent modulation of a particular cell 
function that identifies a nucleic acid sequence that 
modulates the function of a cell will vary depending on 
the assay, phenotype, and the particular nucleic acid af- 
fected by PTGS. For each assay, the percent modula- 
tion can readily be determined by one skilled in the art, 
when used in conjunction with controls, as described 
herein. Desirably the modulation is at least 20%, more 
desirably at least 30%, 40%, 50%, 60%, 75%, and most 
desirably at least 90% compared to the control. An In- 
crease in the function of a ceil can also be measured in 
terms of fold increase, where desirably, the increase is 
at least 1 .5-fold to 5-fold compared to the control. 
[0093] Alternatively, the function of a cell may be to 
affect the function, gene expression, or polypeptide bi- 
ological activity of another cell, for example, a neighbor- 
ing cell, a cell that is contacted with the cell in which a 
PTGS event occurs, or a cell that is contacted with me- 
dia or other extracellular fluid that the cell in which a 
PTGS event occurs is contained in. For example, a cell 
experiencing PTGS of a gene may modufate cell motil- 
ity, apoptosis, cell growth, cell invasion, vascularization, 



cell cycle events, cell differentiation, cell dedifferentia- 
tion, neuronal cell regeneration, or the ability to support 
viral replication of a nearby cell, or a cell that is exposed 
to media or other extracellular fluid in which the trans- 

5 fected cell in which a PTGS event occurs was once con- 
tained. This can be tested by removing the media in 
which a cell experiencing a PTGS event is occurring and 
placing it on a separate cell or population of cells. If the 
function of the separate cell or population of cells is mod- 

10 ulated, compared to a cell or population of cells receiv- 
ing media obtained from cells that had been mock trans- 
fected, then one or more of the cells experiencing a 
PTGS event can affect the function of another cell. The 
identity of the nucleic acid sequence that causes the 

'5 modulation can be identified with repeated rounds of se- 
lection. 

[0094] In another method, a single cell experiencing 
a PTGS event can be placed in proximity of a cell or a 
population of cells that was not transfected with dsRNA, 

20 and the effect of this placement is evaluated for a mod- 
ulation in the function of the cell or population of cells. 
If the function of the n on -transfected cell or population 
of cells is modulated, compared to a cell or population 
of cells in proximity of a cell that was mock transfected, 

25 then the cell experiencing a PTGS event contains a nu- 
cleic acid sequence that can affect the function of an- 
other cell. This nucleic acid sequence can be identified 
using techniques described herein. 

30 Identification of genes using differential gene 
expression 

[0095] Differential gene expression analysis can be 
used to identify a nucleic acid sequence that modulates 

35 the expression of a target nucleic acid in a cell. Altera- 
tions in gene expression induced by gene silencing can 
be monitored in a cell into which a dsRNA has been in- 
troduced. For example, differential gene expression can 
be assayed by comparing nucleic acids expressed in 

^o cells into which dsRNA has been introduced to nucleic 
acids expressed in control cells that were not transfect- 
ed with dsRNA or that were mock-transfected. Gene ar- 
ray technology can be used in order to simultaneously 
examine the expression levels of many different nucleic 

^5 acids. Examples of methods for such expression anal- 
ysis are described by Marrack et al. (Current Opinions 
in Immunology 12:206-209, 2000); Haricin (Oncologist 
5:501 -507, 2000); Pelizzari era/. (Nucleic Acids Res. 28: 
4577-4581, 2000); and Marx (Science 289:1670-1672, 

50 2000). 

Identification of genes by assaying polypeptide 
biological activity 

55 [0096] Novel nucleic acid sequences that modulate 
the biological activity of a target polypeptide can also be 
identified by examining polypeptide biological activity. 
Various polypeptide biological activities can be evaluat- 
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ed to identify novel genes according to the methods of 
the invention. For example, the expression of a target 
polypeptide(s) may be examined. Alternatively, the in- 
teraction between a target polypeptide(s) and another 
molecule(s), for example, another polypeptide or a nu- 
cleic acid may be assayed. Phosphorylation or glyco- 
sylation of a target polypeptide(s) may also be as- 
sessed, using standard methods known to those skilled 
in the art. 

[0097] Identification of nucleic acid sequences in- 
volved in modulating the biological activity of a target 
polypeptide may be carried out by comparing the 
polypeptide biological activity of a cell transfected with 
a dsRNA to a control cell that has not been transfected 
with a dsRNA or that has been mock-transfected. A cell 
that has taken up sequences unrelated to a particular 
polypeptide biological activity will perform in the partic- 
ular assay in a manner similar to the control cell. A cell 
experiencing PTGS of a gene involved in the particular 
polypeptide biological activity will exhibit an altered abil- 
ity to perform in the biological assay, compared to the 
control. 

Insertion of single units into the chromosome and 
generation of a cell line containing a single dsRNA 
expression library integrant 

[0098] The present invention involves the generation 
of a target cell line in which the dsRNA expression library 
is subsequently introduced. Through the use of site-spe- 
cific recombination, single integrants of dsRNA expres- 
sion cassettes are generated at the same locus of all 
cells in the target cell line, allowing uniform expression 
of the dsRNA in all of the integrants. A dsRNA expres- 
sion library derived from various cell lines is used to cre- 
ate a representative library of stably integrated cells, 
each cell within the target cell line containing a single 
integrant. Cre/tax, Lambda-Cro repressor, and Flp re- 
combinase systems or retroviruses are used to gener- 
ate these singular integrants of dsRNA expression cas- 
settes in the target cell line (Satoh et aL, J. Virol. 74: 
10631-10638, 2000; Trinh et aL, J. Immunol. Methods 
244:185-193, 2000; Serov et aL, An. Acad. Bras. Cienc. 
72:389-398, 2000; Grez ef a/., Stem Cells. 16:235-243, 
1 998; Habu etaL, Nucleic Acids Symp. Ser. 42:295-296, 
1999; Haren et a/., Annu. Rev. Microbiol. 53:245-281, 
1999; Baer et aJ. t Biochemistry 39:7041-7049, 2000; 
Follenzi et af. t Nat. Genet. 25:217-222, 2000; Hind- 
marsh et aL, Microbiol. Mol. Biol. Rev. 63:836-843, 
1999; Darquet ef aL, GeneTher. 6:209-218, 1999; Dar- 
quet era/., Gene Ther. 6:209-21 8, 1 999; Yu et aL, Gene 
223:77-81, 1998; Darquet et aL, Gene Ther. 4: 
1341-1349, 1997; and Koch etal., Gene 249:135-144, 
2000). These systems are used singularly to generate 
singular insertion clones, and also in combination. 
[0099] The following exemplary sequence specific in- 
tegrative systems use short target sequences that allow 
targeted recombination to be achieved using specific 



proteins: FLP recombinase, bacteriophage Lambda in- 
tegrase, HIV integrase, and pilin recombinase of Salmo- 
nella (Seng et af. Construction of a Flp "exchange cas- 
sette" contained vector and gene targeting in mouse ES 
5 cell] A book chapter PUBMED entry 11797223 - Sheng 
Wu Gong Cheng Xue Bao. 2001 Sep;1 7(5):566-9., Liu 
etaL, Nat Genet. 2001 Jan 1;30(1):66-72., Awatramani 
e/a/.,NatGenet.2001 Nov;29(3):257-9., Heidmannand 
Lehner, Dev Genes Evol. 2001 Sep;211(8-9):458-65, 
10 Schaft ef a/., Genesis. 2001 Sep;31{1):6-10, Van 
Duyne, Annu Rev Biophys Biomol Struct. 2001 ;30: 
87-104., Lorbach etaL, J Mol Biol. 2000 Mar 10;296(5): 
1175-81., Darquet et aL, Gene Ther. 1999 Feb;6(2): 
209-18., Bushman and Miller, J Virol. 1997 Jan;71(1): 

15 458-64., Fulks et aL, J Bacterid. 1990 Jan;172(1): . 
310-6). A singular integrant is produced by randomly in- 
serting the specific sequence {e.g., loxP in the ere re- 
combinase system) and selecting or identifying the cell 
that contains a singular integrant that supports maximal 

20 expression. For example, integrants that show maximal 
expression following random integration can be identi- 
fied through the use of reporter gene sequences asso- 
ciated with the integrated sequence. The cell can be 
used to specifically insert the expression cassette into 

25 the site that contains the target sequence using the spe- 
cific recombinase, and possibly also remove the expres- 
sion cassette that was originally placed to identify the 
maximally expressing chromosomal location. A skilled 
artisan can also produce singular integrants using ret- 

30 roviral vectors, which integrate randomly and singularly 
into the eukaryotic genome. In particular, singular inte- 
grants can be produced by inserting retroviral vectors 
that have been engineered to contain the desired ex- 
pression cassette into a naive cell and selecting for the 

35 chromosomal location that results in maximal expres- 
sion (Michael ef al., EMBO Journal, vol 20: pages 
2224-2235, 2001; Reik and Murrell., Nature, vol. 405, 
page 408-409, 2000; Berger etal., Molecular Cell, vol. 
8, pages 263-268). One may also produce a singular 

40 integrant by cotransfecting the bacterial RecA protein 
with or without nuclear localization signal along with se- 
quences that are homologous to the target sequence (e. 
g., a target endogenous sequence or integrated trans- 
gene sequence). Alternatively, a nucleic acid sequence 

45 that encodes a RecA protein with nuclear localization 
signals can be cotransfected (Shibata et aL, Proc Natl 
Acad Sci U S A. 2001 Jul 17;98(15):8425-32. Review., 
Muyrers ef aL, Trends Biochem Sci. 2001 May;26(5): 
325-31., Paul efaA.Mutat Res. 2001 Jun5;486(1):11-9., 

50 Shchertoakova ef aL, Mutat Res. 2000 Feb 16;459(1): 
65-71., Lantsov. Mol Biol (Mosk). 1994 May-Jun;28(3): 
485-95). 

[0100] An example utilizing such methods is detailed 
below. 

55 

Creation of the target cell line 

[01 01 ] Target cell lines are the same cell lines as the 
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ones from which the dsRNA expression libraries will be 
derived. Target cells are created by transfecting the se- 
lected cell line with a bicistronic plasmid expressing a 
selectable marker, such as G41 8 and the reporter gene 
GFP. The plasmid also bears a /oxPsite. Plasmids inte- 5 
grate randomly into the chromosome through the proc- 
ess of illegitimate recombination at a frequency of 10~ 4 . 
Following transfection, cells containing integrants are 
selected by culturing the cells in the presence of G418 
at a concentration determined earlier in a kill curve anal- 10 
ysis. About a dozen G418-resistant colonies are ex- 
panded and relative GFP expression levels are deter- 
mined using flow cytometry. DNA from the cells is ana- 
lyzed by Southern blot analysis to determine integrant 
copy number. Several single copy integrants exhibiting 15 
the highest GFP expression levels are then selected as 
the target cell lines. GFP expression is monitored be- 
cause dsRNA encoding templates are then integrated 
into the loci containing the foxP, GFP, and G418 cas- 
settes in a site-specific fashion, and it is important to 20 
ensure that these loci are transcriptionally active. Since 
cells are selected on the basis of G418 resistance and 
GFP expression, integration of the plasmid DNA can oc- 
cur at the /oxPsite, destroying its function. Several cell 
lines are therefore chosen to reasonably ensure that at 25 
least one integrant has an intact loxP site. 

Double stranded RNA expression library construction 
and site-specific recombination into the target celt line 

30 

[01 02] A cDNA library or a randomized library is con- 
structed from RNA isolated from selected cell lines. cD- 
NAs or randomized nucleic acids in the size range of at 
least 100 to 1000 nucleotides, for example, 500 to 600 
nucleotides are optimized during synthesis or are size- 35 
selected prior to cloning. In other embodiments, the nu- 
cleic acids are at least 10, 20, 30, 40, 50, 60, 70, 80, or 
90 nucleotides in length. In yet other embodiments, the 
number of nucleotides in the nucleic acids is between 
5-100 nucleotides, 15-100 nucleotides, 20-95 nude- *o 
otides, 25-90 nucleotides, 35-85 nucleotides, 45-80 nu- 
cleotides, 50-75 nucleotides, or 55-70 nucleotides, in- 
clusive. In still other embodiments, the number of nucle- 
otides in the nucleic acids is contained in one of the fol- 
lowing ranges: 5-15 nucleotides, 15-20 nucleotides, 45 
20-25 nucleotides, 25-35 nucleotides, 35-45 nucle- 
otides, 45-60 nucleotides, 60-70 nucleotides, 70-80 nu- 
cleotides, 80-90 nucleotides, or 90-1 00 nucleotides, in- 
clusive. In other embodiments, the nucleic acid contains 
less than 50,000; 10,000; 5,000; or 2,000 nucleotides, so 
Each cDNA or randomized nucleic acid is then cloned 
into a plasmid vector as a dsRNA transcription cassette 
flanked by two convergent promoters (such as T7 pro- 
moters as described herein). The promoters are tran- 
scriptionally regulated such that they are off until in- 55 
duced, for example, using a tet ON/OFF system (Forster 
et a/., Nucleic Acids Res. 27:7708-710, 1999; Liu etaL, 
Biotechniques 24:624-628, 6,30-632, 1998; and Gatz, 



Methods Cell Biol. 50:411-424, 1995). The plasmid also 
contains the hygromycin resistance gene and an invert- 
ed loxP site. The cDNA plasmid library or randomized 
plasmid library is then co-transfected into the target cell 
line with a plasmid expressing Cre recombinase, which 
catalyzes site-specific recombination of the transfected 
cDNA plasmid or randomized nucleic acid plasmid at the 
inverted loxPsiXe into the chromosomal locus containing 
the GFP gene and toxPsite (see Fig. 1). The use of the 
Cre/lox system allows the efficient integration of a plas- 
mid into the chromosome (every transfected cell is pre- 
dicted to undergo a plasmid integration event). Other 
site-specific recombination strategies can also be uti- 
lized. This results in having every integration to occur at 
the same site : thereby obviating potential problems with 
loci dependent expression. 

[01 03] Two days following transfection, cells are incu- 
bated in the presence of hygromycin to kill untransfected 
cells and to select for stable integrants. Transcription of 
dsRNA is induced, and selected cells are assayed for 
an alteration in cell function, the biological activity of a 
target polypeptide, or differential gene expression. Cells 
expressing dsRNA corresponding to a target nucleic ac- 
id exhibit an altered function, for example, increased or 
decreased cell invasion, motility, apoptosis, growth, dif- 
ferentiation, dedifferentiation, or regeneration, or the 
ability of the cell to support viral replication. Cells exhib- 
iting altered function are then expanded and the se- 
quence of the integrant is determined. Targets are iden- 
tified and validated using dsRNA specific for the identi- 
fied target, or other non-PTGS mediated methods, for 
example antisense technology. 
[0104] The regulated transcription system of the 
present invention provides an advantage to other dou- 
ble stranded expression systems. Following transfec- 
tion of the dsRNA library, cells contain hundreds to thou- 
sands of dsRNA expression cassettes, with concomi- 
tant expression of that many expression cassettes. In 
the dsRNA expression system of the present invention, 
dsRNA expression cassettes contained within the ex- 
pression vector integrate into the chromosome of the 
transfected cell. As described in detail below, every 
transfected cell integrates one of the double stranded 
expression cassettes. Desirably no transcription occurs 
until the episomal (non-Integrated) expression vectors 
are diluted out of the cell such that not more than 5 ep- 
isomal vectors remain in the cell. Most desirably no tran- 
scription occurs until all of the episomal (non-integrated) 
expression vectors are diluted out of the cell and only 
the integrated expression cassette remains (a process 
usually taking about two to several weeks of cell cul- 
ture). At this time transcription is induced, allowing dsR- 
NA to be expressed in the cells. This method ensures 
that only one species of dsRNA is expressed per cell, 
as opposed to other methods that express hundreds to 
thousands of double stranded species. The use of the 
above-described system results In the loss of all but one 
expression cassette, which in turn, permits the rapid 
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screening of libraries without requiring screening multi- 
ple pools of libraries to identify the target gene. 

Non-library approaches for the identification of a nucleic 
acid sequence that modulates cell function, gene 
expression in a cell, or the biological activity of a target 
polypeptide in a cell through PTGS 

[0105] Nucleic acid sequences that modulate cell 
function, gene expression in a cell, or the biological ac- 
tivity of a target polypeptide in a cell may also be iden- 
tified using non-library based approaches involving 
PTGS. For example, a single known nucleic acid se- 
quence encoding a polypeptide with unknown function 
or a single nucleic acid fragment of unknown sequence 
and/or function can be made into a double stranded 
RNA molecule. This dsRNA is then transfected Into a 
desired cell type and the cell is assayed for modulations 
in cell function, gene expression of a target nucleic acid 
in the cell, or the biological activity of a target polypep- 
tide in the cell, using methods described herein. A mod- 
ulation in cell function, gene expression in the cell, or 
the biological activity of a target polypeptide in the cell 
identifies the nucleic acid of the dsRNA as a nucleic acid 
the modulates the specific cell function, gene expres- 
sion, or the biological activity of a target polypeptide. As 
a single dsRNA species is transfected into the cells, the 
nucleic acid sequence responsible for the modulation is 
readily identified. This non-library based approach to 
nucleic acid identification is desirably used under con- 
ditions that inhibit an interferon response or dsRNA 
stress response. Such conditions are described in detail 
herein. 

[0106] The discovery of novel genes through the 
methods of the present invention may lead to the gen- 
eration of novel therapeutics. For example, genes that 
decrease cell invasion may be used as targets for drug 
development, such as for the development of cytostatic 
therapeutics for use in the treatment of cancer. Devel- 
opment of such therapeutics is important because cur- 
rently available cytotoxic anticancer agents are also tox- 
ic for normal rapidly dividing cells. In contrast, a cyto- 
static agent may only need to check metastatic process- 
es, and by inference, slow cell growth, in order to stabi- 
lize the disease. In another example, genes that in- 
crease neuronal regeneration may be used to develop 
therapeutics for the treatment, prevention, or control of 
a number of neurological diseases, including Alzheim- 
er's disease and Parkinson's disease. Genes that are 
involved in the ability to support viral replication and be 
used as targets in anti-viral therapies. Such therapies 
may be used to treat, prevent, or control viral diseases 
involving human immunodeficiency virus (HIV), hepati- 
tis C virus (HCV), hepatitis B virus (HBV), and human 
papillomavirus (HP V). The efficacies of therapeutics tar- 
geting the genes identified according to the present in- 
vention can be further tested in cell culture assays, as 
well as in animal models. 



The use of vertebrate or mammalian cell lines for 
identification of nucleic acid sequences that modulate 
cell function, expression of a target nucleic acid or 
biological activity of a target polypeptide 

5 

[0107] While the use of the present invention is not 
limited to vertebrate or mammalian cells, such cells can 
be used to carry out the nucleic acid identification meth- 
ods described herein. Desirably the vertebrate or mam- 

10 malian cells used to carry out the present invention are 
cells that have been cultured for only a small number of 
passages (e.g., less than 30 passages of a cell line that 
has been obtained directly from American Type Culture 
Collection), or are primary cells. In addition, vertebrate 

'5 or mammalian cells can be used to carry out the present 
invention when the dsRNA being transfected into the 
cell is not complexed with cationic lipids. 
[01 08] The following examples are to illustrate the in- 
vention. They are not meant to limit the invention in any 

20 way. For example, it is noted that any of the following 
examples can be used with double stranded RNAs of 
any length. The methods of the present invention can 
be readily adapted by one skilled in the art to utilize dou- 
ble stranded RNAs of any desired length. 

25 

Example 1 : Design and delivery of vectors for 
intracellular synthesis of dsRNA for library based 
screening approaches to nucleic acid identification 
using PTGS 

[0109] PTGS is induced when dsRNA is made intra- 
cellularly. The library based screening approaches to 
nucleic acid identification through PTGS may require 
that dsRNA reside in certain cellular compartments in 
order to exert its effect. Therefore, expression plasmids 
that transcribe dsRNA in the cytoplasm and in the nu- 
cleus are utilized. There are two classes of nuclear tran- 
scription vectors: one that is designed to express poly- 
adenylated dsRNA (for example, a vector containing an 
RNA polymerase II promoter and a poty A site) and one 
that expresses non-adenylated dsRNA (for example, a 
vector containing an RNA polymerase II promoter and 
no poly A site, or a vector containing a T7 promoter). 
Different cellular distributions are predicted for the two 
species of RNA; both vectors are transcribed in the nu- 
cleus, but the ultimate destinations of the RNA species 
are different intracellular locations. Intracellular tran- 
scription may also utilize bacteriophage T7 and SP6 
RNA polymerase, which may be designed to transcribe 
in the cytoplasm or in the nucleus. Alternatively, Qbeta 
replicase RNA-dependent RNA polymerase may be 
used to amplify dsRNA. Viral RNA polymerases, either 
DNA and RNA dependent, may also be used. Alterna- 
tively, dsRNA replicating polymerases can be used. Cel- 
lular polymerases such as RNA Polymerase I, II, or III 
or mitochondrial RNA polymerase may also be utilized. 
Both the cytoplasmic and nuclear transcription vectors 
contain an antibiotic resistance gene to enable selection 
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of cells that have taken up the plasmid. Cloning strate- 
gies employ chain reaction cloning (CRC), a one-step 
method for directional ligation of multiple fragments (Pa- 
chuk etai, Gene 243:19-25, 2000). Briefly, the ligations 
utilize bridge oligonucleotides to align the DNA frag- 
ments in a particular order and ligation is catalyzed by 
a heat-stable DNA ligase, such as Ampligase, available 
from Epicentre. 

inducible or repressive transcription vectors lor the 
generation of a dsRNA expression library 

[01 1 0] If desired, inducible and repressible transcrip- 
tion systems can be used to control the timing of the 
synthesis of dsRNA. Inducible and repressible regula- 
tory systems involve the use of promoter elements that 
contain sequences that bind prokaryotic or eukaryotic 
transcription factors upstream of the sequence encod- 
ing dsRNA. In addition, these factors also carry protein 
domains that transactivate or transrepress the RNA 
polymerase II . The regulatory system also has the ability 
to bind a small molecule (e.g., a coinducer or a core- 
pressor). The binding of the small molecule to the reg- 
ulatory protein molecule (e.g., a transcription factor) re- 
sults in either increased or decreased affinity for the se- 
quence element. Both inducible and repressible sys- 
tems can be developed using any of the inducer/tran- 
scription factor combinations by positioning the binding 
site appropriately with respect to the promoter se- 
quence. Examples of previously described inducible/re- 
pressible systems include lacl, ara, Steroid-RU486, and 
ecdysone - Rheogene, Lac (Cronin et at. Genes & De- 
velopment 15: 1506-1517, 2001), ara (KhlebnikovetaL, 
J Bacterid. 2000 Dec; 182(24): 7029-34), ecdysone 
(Rheogene, www.rheogene.com), RU48 (steroid, Wang 
XJ, Liefer KM, Tsai S, O'Malley BW, Roop DR., Proc Natl 
Acad Sci U S A. 1 999 Jul 20;96(1 5):8483-8), re/promot- 
er (Rendal et a/ M Hum Gene Then 2002 Jan;13(2): 
335-42. and Larnartina et a/., Hum Gene Ther. 2002 
Jan; 13(2):1 99-210), or a promoter disclosed in WO 
00/63364, filed April 19, 2000. 

Nuclear transcription vectors for the generation of a 
nuclear dsRNA expression library 

[0111] Nuclear transcription vectors for use in library 
based screening approaches to identify nucleic acids 
that modulate cell function, gene expression, or the bi- 
ological activity of a target polypeptide are designed 
such that the target sequence is flanked on one end by 
an RNA polymerase II promoter (for example, the HC- 
MV-IE promoter) and on the other end by a different 
RNA polymerase II promoter (for example, the SCMV 
promoter). Other promoters that can be used include 
other RNA porymerase II promoters, an RNA polymer- 
ase I promoter, an RNA porymerase III promoter, a mi- 
tochondrial RNA polymerase promoter, or a T7 or SP6 
promoter in the presence of T7 or SP6 RNA polymerase, 



respectively, containing a nuclear localization signal. 
Bacteriophage or viral promoters may also be used. The 
promoters are regulated transcriptionally (for example, 
using a tet ON/OFF system (Forster et ai. supra; Liu et 
s at., supra; and Gatz, supra) such that they are only ac- 
tive in either the presence of a transcription-inducing 
agent or upon the removal of a repressor. A single chro- 
mosomal integrant is selected for, and transcription is 
induced in the cell to produce the nuclear dsRNA. 
10 [0112] Those vectors containing a promoter recog- 
nized by RNA Pol I, RNA Pol II, or a viral promoter in 
conjunction with co-expressed proteins that recognize 
the viral promoter, may also contain optional sequences 
located between each promoter and the inserted cDNA. 

is These sequences are transcribed and are designed to 
prevent the possible translation of a transcribed cDNA. 
For example, the transcribed RNA is synthesized to con- 
tain a stable stem- loop structure at the 5' end to impede 
ribosome scanning. Alternatively, the exact sequence is 

20 irrelevant as long as the length of the sequence is suf- 
ficient to be detrimental to translation initiation (e.g., the 
sequence is 200 nucleotides or longer). The RNA se- 
quences can optionally have sequences that allow 
polyA addition, intronic sequences, an HIV REV binding 

25 sequence, Mason-Pfizer monkey virus constitutive 
transport element(CTE) (U.S. 5 : 880,276, filed April 25, 
1996), and/or self splicing intronic sequences. 
[0113] To generate dsRNA, two promoters can be 
placed on either side of the target sequence, such that 

30 the direction of transcription from each promoter is op- 
posing each other. Alternatively, two plasmids can be 
cotransfected. One of the plasmids is designed to tran- 
scribe one strand of the target sequence while the other 
is designed to transcribe the other strand. Single pro- 

35 moter constructs may be developed such that two units 
of the target sequence are transcribed in tandem, such 
that the second unit is in the reverse orientation with re- 
spect to the other. Alternate strategies include the use 
of filler sequences between the tandem target sequenc- 

40 es. 

Cytoplasmic transcription vectors for the generation of 
a cytoplasmic dsRNA expression library 

« [0114] Cytoplasmic transcription vectors for use in li- 
brary based screening approaches to identifying nucleic 
acids that modulate cell function, gene expression, or 
the biological activity of a target polypeptide in a cell us- 
ing PTGS are made according to the following method. 

50 This approach involves the transcription of a single 
stranded RNA template (derived from a library) in the 
nucleus, which is then transported into the cytoplasm 
where it serves as a template for the transcription of 
dsRNA molecules. The DNA encoding the ssRNA is in- 

55 tegrated at a single site in the target cell line as de- 
scribed for the nuclear RNA expression library, thereby 
ensuring the synthesis of only one species of dsRNA in 
a cell, each cell expressing a different dsRNA species. 
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[0115] A desirable approach is to use endogenous 
polymerases such as the mitochondrial polymerase in 
animal cells or mitochondrial and chloroplast polymer- 
ases in plant cells for cytoplasmic and mitochondrial (e. 
g.. chloroplast) expression to make dsRNA in the cyto- 
plasm. These vectors are formed by designing expres- 
sion constructs that contain mitochondrial or chloroplast 
promoters upstream of the target sequence. As de- 
scribed above for nuclear transcription vectors, dsRNA 
can be generated using two such promoters placed on 
either side of the target sequence, such that the direc- 
tion of transcription from each promoter is opposing 
each other. Alternatively, two plasmids can be cotrans- 
fected. One of the plasmids is designed to transcribe 
one strand of the target sequence while the other is de- 
signed to transcribe the other strand. Single promoter 
constructs may be developed such that two units of the 
target sequence are transcribed in tandem, such that 
the second unit is in the reverse orientation with respect 
to the other. Alternate strategies include the use of filler 
sequences between the tandem target sequences. 
[0116] Alternatively, cytoplasmic expression of dsR- 
NA for use in library based screening approaches is 
achieved by a single subgenomic promoter opposite in 
orientation with respect to the nuclear promoter. The nu- 
clear promoter generates one RNA strand that is trans- 
ported into the cytoplasm, and the singular subgenomic 
promoter at the 3' end of the transcript is sufficient to 
generate its antisense copy by an RNA dependent RNA 
polymerase to result in a cytoplasmic dsRNA species. 

Target cell line development for use with cytoplasmic 
dsRNA expression libraries 

[01 1 7] The target cell line, using the vector containing 
the G418 cassette, GFP, and loxP site is designed as 
described above. 

Development of a cytoplasmic dsRNA expression 
library 

[0118] Double stranded RNA expression libraries are 
generated by inserting cDNA or randomized sequences 
(as described herein) into an expression vector contain- 
ing a single nuclear promoter (RNA polymerase I, RNA 
polymerase II, or RNA polymerase III), which allows 
transcription of the insert sequence. It is desirable that 
this nuclear promoter activity is regulated transcription- 
ally (for example, using a tet ON/OFF system described, 
for example, by Forster et aJ., supra; Liu et a/., supra; 
and Gatz, supra), such that the promoters are only ac- 
tive in either the presence of a transcription-inducing 
agent or upon the removal of a repressor. This ensures 
that transcription is not induced until episomal copies of 
the vector(s) are diluted out. Vectors also contain a se- 
lectable marker, such as the hygromycin resistance 
gene, and a loxPsite. The expression vectors are inte- 
grated into the target cell line by methods previously de- 



scribed in this application using Cre recombinase (other 
site-specific recornbi native strategies can be employed, 
as described previously). 

[0119] At two days post-transfection, cells are sub- 

5 jected to hygromycin selection using concentrations es- 
tablished in kill curve assays. Surviving cells are cul- 
tured in hygromycin to select for cells bearing integrated 
vectors and to dilute out episomal copies of the vector 
(s). At this point transcription is induced, and a single 

io stranded RNA (ssRNA) species derived from the insert 
sequence is transcribed in the nucleus from the nuclear 
promoter in the inserted vector. The insert is designed 
such that the insert sequences in the transcript are 
flanked by bidirectional promoters of RNA bacteri- 

15 ©phages (for example, Qbeta or MS2, RNA dependent 
RNA polymerase promoters) or cytoplasmic viral RNA- 
dependent RNA polymerase promoter sequences (for 
example, those of Sindbis or VEEV subgenomic pro- 
moters). The nuclear transcript is translocated to the cy- 

20 toplasm where it acts as a template for dsRNA by an 
RNA dependent RNA polymerase, which may be pro- 
vided through co-transfection of a vector that encodes 
an RNA-dependent RNA polymerase. Alternatively, an 
integrated copy of the polymerase may be used. 

25 

Example 2: Generation of templates for in vitro 
transcription of dsRNA for non-library based 
approaches for identification of nucleic acids using 
PTGS ^ 

30 

[0120] Nucleic acid sequences that modulate cell 
function, gene expression in a cell, or the biological ac- 
tivity of a target polypeptide in a cell may also be iden- 
tified using non-library based approaches involving 

35 PTGS. A single known nucleic acid sequence encoding 
a polypeptide with unknown function or a single nucleic 
acid fragment of unknown sequence and/or function can 
be made into a double stranded RNA molecule. This 
dsRNA is then transfected into a desired cell type and 

*o assayed for modulations in cell function, gene expres- 
sion in the cell, or the biological activity of a target 
polypeptide in the cell, using methods described herein. 
A modulation in cell function, gene expression in the cell, 
or the biological activity of a target polypeptide in the 

45 cell Identifies the nucleic acid of the dsRNA as a nucleic 
acid the modulates the specific cell function, gene ex- 
pression, or the biological activity of a target polypep- 
tide. This non-library based approach to nucleic acid 
identification is desirably used under conditions that in- 

50 hibit an interferon response or dsRNA stress response. 
Such conditions are described in detail below. 
[0121] Nucleic acid fragments generated, for exam- 
ple, by PCR or restriction endo nuclease digestion, en- 
coding the respective target sequences were used as 

55 templates for in vitro transcription reactions. PCR frag- 
ments are superior to plasmid templates for the synthe- 
sis of discrete sized RNA molecules. The PCR frag- 
ments encoded at least 20-50 or 100 to 1 000, for exam- 
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pie, 500 to 600 nucleotides (nts) of the target sequence 
and were derived from the target mRNA. Known target 
sequences were obtained from GenBank and or other 
DNA sequence databases. Target sequences were also 
obtained from cellular RNAs that were generated into 
cDNAs to create a number of different dsRNA mole- 
cules. Accordingly, it is possible that the sequence and/ 
or function of the target sequence was not known at the 
time the dsRNA was generated. 
[01 22] Templates for sense target RNAs were gener- 
ated by placing the bacteriophage T7 promoter at the 5' 
end of the target coding strand while antisense RNA 
templates contained the T7 promoter at the 5' end of the 
non-coding strand. This was achieved by encoding the 
T7 promoter at the 5' ends of the respective PCR prim- 
ers. Alternatively SP6 promoters, or a combination of 
SP6 and T7 promoters may be used. 
[01 23] PCR was performed by conventional methods. 
The use of both PCR templates in equimolar amounts 
in an in vitro transcription reaction resulted in primarily 
dsRNA. The use of two separate fragments has been 
found to be superior to the use of one PCR fragment 
containing two T7 promoters, one located at each end 
of the target sequence, presumably due to transcription 
interference that occurs during transcription of the dual 
promoter template. Following PCR amplification, the 
DNA was subjected to Proteinase K digestion and phe- 
nol -chloroform extraction to remove contaminating 
RNases. Following ethanol precipitation, the DNA was 
resuspended in RNase-free water at a concentration of 
1 to 3 u.g/uJ. 

[01 24] As an alternative to phenol-chloroform extrac- 
tion, DNA can be purified in the absence of phenol using 
standard methods such as those described by Li et at. 
(WO 00/44914, filed January 28, 2000). Alternatively, 
DNA that is extracted with phenol and/or chloroform can 
be purified to reduce or eliminate the amount of phenol 
and/or chloroform. For example, standard column chro- 
matography can be used to purify the DNA (WO 
00/44914, filed January 28, 2000). 

Example 3: In vitro RNA transcription and RNA analysis 

[0125] in vitro transcription reactions are carried out 
using the Riboprobe Kit (Promega Corp.), according to 
the manufacturer's directions. The template DNA is as 
described above. Following synthesis, the RNA is treat- 
ed with RQ1 DNase (Promega Corp.) to remove tem- 
plate DNA. The RNA is then treated with Proteinase K 
and extracted with phenol-chloroform to remove con- 
taminating RNases. The RNA is ethanol precipitated, 
washed with 70% ethanol, and resuspended in RNase- 
free water. Aliquots of RNA are removed for analysis 
and the RNA solution is flash frozen by incubating in an 
ethanol-dry ice bath. The RNA is stored at -80°C. 
[01 26] As an alternative to phenol-chloroform extrac- 
tion, RNA can be purified in the absence of phenol using 
standard methods such as those described by Li et al 



(WO 00/44914, filed January 28, 2000). Alternatively, 
RNA that is extracted with phenol and/or chloroform can 
be purified to reduce or eliminate the amount of phenol 
and/or chloroform. For example, standard column chro- 

5 matography can be used to purify the RNA (WO 
00/44914, filed January 28, 2000). 
[0127] dsRNA is made by combining equimolar 
amounts of PCR fragments encoding antisense RNA 
and sense RNA, as described above, in the transcription 

io reaction. Single stranded antisense or sense RNA is 
made by using a single species of PCR fragment in the 
reaction. The RNA concentration is determined by spec- 
trophotometry analysis, and RNA quality is assessed 
by denaturing gel electrophoresis and by digestion with 

*5 RNase T1 , which degrades single stranded RNA. 
[01 28] An mRNA library is produced using Qbeta bac- 
teriophage, by ligating the mRNAs to the flank sequenc- 
es that are required for Qbeta replicase function (Qbeta 
flank or Qbeta flank plus P1), using RNA ligase. The 

20 Ijgated RNAs are then transformed into bacteria that ex- 
press Qbeta replicase and the coat protein. Single 
plaques are then inoculated into fresh bacteria. All 
plaques are expected to carry transgene sequences. 
Each plaque is grown in larger quantities in bacteria that 

25 produce the Qbeta polymerase, and RNA is isolated 
from the bacteriophage particles. Alternatively, if the 
Qbeta flank plus P1 is used to generate the library (a 
g. t P1=MS2, VEEV ( or Sindbis promoter sequences), 
these vectors can be used to carry out the in vitro tran- 

30 scription along with the cognate polymerase. The in vitro 
made dsRNA is then used to transfect cells. 

RNA delivery 

35 [0129] In vitro made dsRNA is directly added to the 
cell culture medium at concentrations ranging from 50 
u.g/ml to 500 u.g/ml. Uptake of dsRNA is also facilitated 
by electroporation using those conditions required for 
DNA uptake by the desired cell type. RNA uptake is also 

^0 mediated by lipofection using any of a variey of com- 
mercially available and proprietary cationic lipids, DE- 
AE-dextran-mediated transfection, microinjection, pro- 
toplast fusion, calcium phosphate precipitation, viral or 
retroviral delivery, or biolistic transformation. The RNA 

45 is naked RNA or a local anesthetic RNA complex. Mod- 
ulation of cell function, gene expression, or polypeptide 
biological activity is then assessed in the transfected 
cells. 

[0130] Some dsRNA sequences, possibly in certain 
50 cell types and through certain delivery methods, may re- 
sult In an interferon response. During the screening 
methods of the present invention, induction of an inter- 
feron response is not desired, as this would lead to cell 
death and possibly to the prevention of gene silencing. 
55 [0131] One of the components of an interferon re- 
sponse is the induction of the interferon-induced protein 
kinase PKR. Suppression of the interferon response 
and/or the PKR response is desired in the target cells. 
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The dsRNA delivery methods described herein are per- 
formed such that an interferon response or dsRNA 
stress response is not included. It is recognized, how- 
ever, that certain conditions might present with an in- 
duction of the interferon response. To prevent such a 5 
response, a number of other strategies may be em- 
ployed with any of the above described screening meth- 
ods to identify a nucleic acid that modulates cell func- 
tion, gene expression, or the polypeptide biological ac- 
tivity of a cell, as described herein. 10 
[0132] To prevent an interferon response, interferon 
and PKR responses are silenced in the target cells using 
a dsRNA species directed against the mRNAs that en- 
code proteins involved in the response. Desirably inter- 
feron response promoters are silenced using dsRNA. *5 
Alternatively, the expression of proteins that bind the in- 
terferon response element is abolished using dsRNA 
techniques. 

[0133] In an alternative strategy, interferon and PKR 
knockout cell lines are created through approaches uti- 20 
lizing expression cassettes that encode an antisense 
RNA and ribozymes directed to the cellular mRNAs that 
encode the proteins involved in the response. Knockout 
cells are created by standard gene knockout technolo- 
gies using homologous recombination to alter target se- 25 
quences, using homologous DNA alone, or as complex- 
es of RecA protein and single stranded DNA homolo- 
gous to the target sequence(s). Interferon response el- 
ement (IRE) sequences, sequences that encode tran- 
scription factors that bind IRE sequences, the promoter 30 
and/or gene sequences that encode proteins in the PKR 
and interferon response pathways are molecules that 
are targeted for knockout. 

[0134] If desired, proteins involved in gene silencing 
such as Dicer or Argonaut can be overexpressed or ac- 35 
tivated to increase the amount of inhibition of gene ex- 
pression (Beach et a/., WO 01/68836, filed March 16, 
2001). 

Example 4: Cytoplasmic transcription vectors for non- 40 
library based approaches to nucleic acid identification 
using PTGS 

[0135] Double stranded RNA molecules for use in 
non-library based methods for the identification of nu- <s 
cleic acids that modulate cell function, gene expression 
of a target nuclec acid, or target polypeptide biological 
activity in a cell can also be generated through the use 
of cytoplasmic transcription vectors. Such vectors are 
generated as now described. so 
[0136] The PCR fragments generated for in vitro tran- 
scription templates, as described above, are inserted In- 
to a cloning vector containing one T7 promoter located 
just outside the polylinker region. Such a vector is pZE- 
RO blunt (Promega Corp.). The PCR fragment is cloned 55 
into a restriction site in the polylinker in such a way that 
the fragment's T7 promoter is distal to the vector's pro- 
moter. The resulting vector contains the target se- 



quence flanked by two T7 promoters; transcription from 
this vector occurs in converging directions. Convergent 
transcription is desired for these intracellular vectors, 
due to the uncertainty of getting sense and antisense 
vectors into the same cell in high enough and roughly 
equivalent amounts. In addition, the local concentration 
of antisense and sense RN As with respect to each other 
is high enough to enable dsRNA formation when the du- 
al promoter construct is used. 

[01 37] A hygromycin resistance cassette is cloned in- 
to the pZERO blunt vector as well. The hygromycin re- 
sistance cassette contains the hygromycin resistance 
gene under the control of the Herpes Simplex Virus 
(HSV) thymidine kinase promoter and the SV40 polya- 
denlyation signal. The cassette is in a plasmid vector 
and is flanked at both ends by a polylinker region ena- 
bling ease of removal and subsequent cloning. Hygro- 
mycin selection was chosen because of the rapidity of 
death induced by hygromycin as well as extensive in- 
house experience with hygromycin selection. Alterna- 
tively, other selection methods known to those skilled in 
the art may be used. 

[0138] The vectors are transfected into the desired 
cells using standard transformation or transfection tech- 
niques described herein, and the cells are assayed for 
the ability of the dsRNA molecules encoded by the vec- 
tors to modulate cell function, gene expression of a tar- 
get nuclec acid, or the biological activity of a target 
polypeptide, as described herein. 

Example 5: Analysis of RNA from transfected cells 

[01 39] Regardless of whether a library based screen- 
ing approach or a non-library based approach was used 
to identify nucleic acid sequences, in order to measure 
the level of dsRNA effector molecule within the cell, as 
well as the amount of target mRNA within the cell, a two- 
step reverse transcription PCR reaction was performed 
with the ABI PRISM™ 7700 Sequence Detection Sys- 
tem. Total RNA was extracted from cells transfected with 
dsRNA or a plasmid from a dsRNA expression library 
using Trizol and DNase. Two to three different cDNA 
synthesis reactions were performed per sample; one for 
human GAPDH (a housekeeping gene that should be 
unaffected by the effector dsRNA), one for the target 
mRNA, and/or one for the sense strand of the expected 
ds RNA molecu le (effector molecu le) . Prior to cDN A syn- 
thesis of dsRNA sense strands, the RNA sample was 
treated with T1 RNase. The cDNA reactions were per- 
formed in separate tubes using 200 ng of total RNA and 
primers specific for the relevant RNAs. The cDNA prod- 
ucts of these reactions were used as templates for sub- 
sequent PCR reactions to amplify GAPDH, the target 
cDNA, and/or the sense strand copied from the dsRNA. 
All RNA was quantified relative to the internal control, 
GAPDH. 
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Example 6: Target sequence identification 

[0140] To identify the target sequence affected by a 
dsRNA, using any of the above- described methods, 
DNA is extracted from expanded cell lines (or from the 
transfected cells if using a non-integrating dsRNA sys- 
tem) according to methods well known to the skilled ar- 
tisan. The dsRNA encoding sequence of each integrant 
(or non-integrated dsRNA molecule if using a non-li- 
brary based method) is amplified by PCR using primers 
containing the sequence mapping to the top strand of 
the T7 promoter (or any other promoter used to express 
the dsRNA). Amplified DNA is then cloned into a cloning 
vector, such as pZERO blunt (Promega Corp.), and then 
sequenced. Sequences are compared to sequences in 
GenBank and/or other DNA databases to look for se- 
quence identity or homology using standard computer 
programs. If the target mRNA remains unknown, the 
mRNA is cloned from the target cell line using primers 
derived from the cloned dsRNA by established tech- 
niques (Sambrook et a/., supra). Target validation is then 
carried out as described herein. 
[01 41] In the stably integrated dsRNA expression sys- 
tem described above, despite efforts to reduce negative 
position effects, inefficient dsRNA synthesis by PCR 
methods may occur. This can be circumvented by res- 
cuing the integrated cDNA or randomized nucleic se- 
quences into replicating plasmids. Rescued plasmids 
are amenable to amplification in bacteria and to se- 
quencing. Rescue is achieved by re-transfecting the 
population of cells transfected with the dsRNA expres- 
sion library with the rescue plasmid and a plasmid en- 
coding Cre recombinase. The rescue plasmid carries a 
bacterial origin of replication, a bacterial antibiotic se- 
lection marker, an SV40 origin of replication, and an 
SV40 T antigen expression cassette, as well as loxP 
sites positioned as an inverted repeat to allow Cre-me- 
diated double recombination. The SV40-based origin of 
replication in the rescue plasmid allows amplification of 
rescued sequences in the integrated cells. Following 
rescue, higher levels of transcription are anticipated, 
thereby favoring dsRNA formation. The cells are then 
screened for modulations in cell function, target nucleic 
acid expression, or target polypeptide biological activity 
changes as described herein. 

Example 7: Prevention of an interferon response during 
gene silencing 

[0142] As discussed above, during the above-de- 
scribed screening methods, induction of an interferon 
response is not desired, as this would lead to cell death, 
anti -proliferative responses, and possibly to prevention 
of gene silencing. One of the components of an interfer- 
on response is the induction of the interferon-induced 
protein kinase PKR. Suppression of the interferon re- 
sponse and/or the PKR response is desired in the target 
cells. The dsRNA delivery methods described herein 



are performed such that an interferon response is not 
included. It is recognized, however, that certain condi- 
tions might present with an induction of the interferon 
response. To prevent such a response, a number of oth* 
5 er strategies may be employed with any of the above 
described screening methods to identify a nucleic acid 
that modulates cell function, gene expression, or the 
polypeptide biological activity of a cell, as described 
herein. 

io [0143] To prevent an interferon response in a system 
involving stable integration of the nucleic acid containing 
the dsRNA expression cassette, the vectors used to 
generate either the loxP integrant or the vector that en- 
codes the dsRNA expression cassette are designed to 

15 contain sequences that encode proteins that block the 
PKR response, such as the Vaccinia vims protein E3 
(Romano et a!. t Molecular and Cellular Biology 18: 
7304-7316, 1998; Accession No. M36339), or a cellular 
protein P58 IPK , which the influenza virus mobilizes to 

20 block PKR (Gale et aL, Microbiology and Molecular Bi- 
ology Reviews 64:239-280, 2000; Accession No. 
XM_032882). Several other viral proteins have also 
been identified {e.g., Hepatitis C E2; Accession No. 
S72725) and may be similarly used. These proteins can 

25 be expressed in the desired cell types or in animals 
through the use of any of a number of commercially 
available mammalian expression vectors or vertebrate 
expression vectors. Such vectors can be obtained from 
a number of different manufacturers including Invitrogen 

30 (Carlsbad, CA) Promega ((Madison, Wl), or Clontech 
(Palo Alto, CA). An example of such a vector is the pCI- 
neo Mammalian Expression Vector from Promega. 
[0144] Regardless of whether nucleic acid encoding 
a dsRNA is stably integrated into a chromosome or is 

35 not integrated into a chromosome, the following meth- 
ods may be used to prevent an interference response 
in any of the screening methods of the present invention. 
In one example of an interferon avoidance strategy, in- 
terferon and PKR responses are silenced in the target 

40 cells using a dsRNA species directed against the mR- 
NAs that encode proteins involved in the response. De- 
sirably interferon response promoters are silenced us- 
ing dsRNA. Alternatively, the expression of proteins that 
bind the interferon response element is abolished using 

45 dsRNA techniques. 

[0145] In an alternative strategy, interferon and PKR 
knockout cell lines are created through approaches uti- 
lizing expression cassettes that encode an antisense 
RNA and ribozymes directed to the cellular mRN As that 

so encode the proteins involved in the response. Knockout 
cells are created by standard gene knockout technolo- 
gies using homologous recombination to after target se- 
quences, using homologous DNA alone, or as complex- 
es of RecA protein and single stranded DNA homolo- 

55 gous to the target sequence(s). Interferon response el- 
ement (IRE) sequences, sequences that encode tran- 
scription factors that bind IRE sequences, the promoter 
and/or gene sequences that encode proteins in the PKR 
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and interferon response pathways are molecules that 
are targeted for knockout. 

[0146] In yet another alternative, chimeric oligonucle- 
otides may be used to alter target sequences. Methods 
for inhibiting expression of polypeptides through chimer- 5 
ic oligonucleotides are well known in the art (Igoucheva 
and Yoon, Human Gene Therapy 1 1 :2307-231 2, 2000). 

Example 8: Functional screening for cell invasion 

10 

[0147] Cell invasion is one cell function that may be 
evaluated in the search for novel genes that are modu- 
lated using the methods described herein. Matrigel, a 
biological extracellular matrix, has properties similar to 
that of a reconstituted basement membrane and has '5 
been used to measure the invasive potential of tumor 
cells (Platet and Garcia, supra). Cells transfected with 
randomized or cDNA libraries that have been cloned in- 
to PTGS vectors are monitored for their capacity to in- 
vade matrigel invasion chambers. Cells that have taken 20 
up sequences unrelated to invasion invade the matrigel 
as efficiently as vector-transfected control cells. Cells 
experiencing PTGS of genes that are involved in cell in- 
vasion invade much less efficiently. If the dsRNA ex- 
pression cassette is stably integrated in a chromosome, 25 
these cells are retrieved and second and third rounds of 
selection are carried out in order to isolate specific nu- 
cleic acid sequences relevant to cell invasion. The effect 
of these sequences on invasion is ultimately confirmed 
by their ability to block the formation of tumors in animal 30 
models. 

[0148] Several human cell lines, for example, 
MDA-MB-231, used by Platet and Garcia (supra), 
,SKBr3, and MCF-7ADR, a more metastatic variant of 
MCF-7. MDA-MB-231 breast cancer cells (obtained 35 
from the American Type Culture Collection) are also 
transfected with cDNA libraries or randomized nucleic 
acid libraries constructed into the vectors described 
above. Desirably all cells in this assay contain a single 
copy of a transfected gene, as described above. 40 
[0149] Cells cultured in commercially available 24- or 
96-well formatted systems are used to carry out the cell 
invasion assay. As this screening protocol relies upon 
repeated rounds of selection, it may be desirable to keep 
the cell numbers in each well low enough that enrich- 45 
ment Is seen In each succeeding round, yet high enough 
to recover sufficient cells to culture within a reasonable 
time period. Therefore, culture conditions that result in 
invasion by greater than 50% of the cells and that still 
permit recovery from the surface of the matrigel are so 
made optimal. Non-invasive (NIH3T3 cells) or poorly in- 
vasive (MCF7) cell lines are analyzed in parallel as neg- 
ative controls for invasion. 

[0150] Initially, triplicate cultures of half-log order di- 
lutions from 102 to 10 6 cells per well are plated. Cells 55 
are then recovered by "scrubbing" with a sterile cotton 
swab in fresh culture media and are seeded into 96-well 
plates. The number of invasive cells in the matrigel is 



quantified using either an MTT-based assay (Sasaki 
and Passaniti, Biotechniques 24:1038-1043, 1998) or a 
fluorescent indicator (Gohla et a/., Clin. Exp. Metastasis 
14:451-458, 1996). 

[01 51 ] Once the appropriate cell densities for the as- 
say have been empirically determined, stable transfect- 
ed cells are plated in the matrigel cell invasion cham- 
bers. Each experiment includes the following controls: 
a sample of untransfected cells as a reference culture; 
untransfected cells treated with anti-invasive chemo- 
therapeutic agents, such as taxol or doxorubicin, as a 
positive control for inhibition of invasion; cells transfect- 
ed with empty vectors to confirm that the vector alone 
had no effects on invasion; and cells transfected cells 
with genes that are known to block invasion in this as- 
say, such as estrogen receptor-a or TIMP-2 (Kohn et 
a!., Cancer Research 55:1856-1862, 1995; and Wood- 
house et a/., Cancer (Supplement) 80:1529-1536, 
1997). 

[0152] Cells that fail to invade the matrigel are re- 
moved from each well to the corresponding wells of a 
96-well plate and cultured until macroscopic colonies 
are visible. It is important to collect cells at more than 
one time point after plating, since the time it takes for 
PTGS to be effective may vary, and it may be that dif- 
ferent genes are active at different times after plating. 
Once the cells are transferred to 96-well plates, they are 
diluted out and taken through successive rounds of re- 
screening in the invasion assay in order to expand and 
isolate cell lines with altered invasive ability. As the pop- 
ulation becomes more and more enriched for cells with 
a non-invasive phenotype, the reduction in invasive 
cells in the matrigel can be better quantified via MTT or 
fluorescence assays. Ultimately, a large panel of cloned 
double-stable cell lines is generated. 
[0153] This assay can also be carried out with cells 
into which a dsRNA is not stably integrated into a chro- 
mosome. The assay is conducted essentially as de- 
scribed above except that multiple rounds of selection 
and re-screening are not necessary since the cell is 
transfected with only one dsRNA species. Thus, the tar- 
gets) of the PTGS event is readily identifiable using the 
cloning and sequencing techniques described above. 

Example 9: Downregulation of HIV using HIV-derived 
dsRNA and inhibitors of the interferon response 
pathway 

[0154] During the course of HIV infection, the viral ge- 
nome is reverse transcribed into a DNA template that is 
integrated into the host chromosome of infected dividing 
cells. The integrated copy is now a blueprint from which 
more HIV particles are made. Several cell lines that con- 
tain integrated copies of a defective HIV genome, 
HIVgpt (strain HXB2) have been created. The HIVgpt 
genome contains a deletion of the HIV envelope gene; 
all other HIV proteins are encoded. The plasmid used 
to create these cell lines, HIVgpt was obtained from the 
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AIDS Research and Reference Reagent Program Cat- 
alog. Stably integrated cell lines were made with human 
rhabdomyosarcoma (RD) cells. The lines were made by 
transfecting cells with the plasmid followed by selection 
of cells in mycophenolic acid. The HIVgpt genome en- s 
codes a mycophenolic acid (MPA) resistance gene in 
place of the envelope gene and thereby confers resist- 
ance to MPA. Cells resistant to MPA were clonally am- 
plified. The media from the cultured clonally expanded 
cells was assayed for the presence of p24 (an HIV gag w 
polypeptide that is secreted extracellularly). All cell lines 
were positive for p24, as assessed using a p24 ELISA 
assay kit (Coulter, Fullerton, CA). The cell lines also 
make non-infectious particles which can be rescued into 
infectious particles by co-expression of an HIV envelope is 
protein. 

[0155] The HIVgpt cell lines are used as a model sys- 
tem with which to downregulate HIV expression via 
PTGS. Plasmids encoding a 600 nt sense RNA, a 600 
nt antisense RNA, or a 600 bp double stranded RNA 20 
(dsRNA), mapping to the same coordinates of the gag 
gene of HIV strain HXB2 are used to transfect cells (the 
map from which the coordinates are based is found at 
GenBank Accession numberK03455, HIV(HXB2), com- 
plete genome, and the gag RN As used in this study map 25 
to coordinates 901-1500). Expression of the RNAs is 
from T7 RNA polymerase promoter(s) located at the 5' 
end of the gag sense strand, at the 5* end of the anti- 
sense strand, or at converging positions at the 5* ends 
of both the sense and anti-sense strands, respectively. 30 
These encoded RNAs are not designed to be able to 
make protein (i.e., they do not have a cap, a poly A tail, 
orthe native initiation codon). Transcription of the RNAs 
is catalyzed by T7 RNA polymerase, provided from a 
second co-transfected T7 RNA polymerase expression 35 
plasmid. Control plasmids expressing a similar sized 
sense RNA, antisense RNA, and dsRNA derived from 
the gD gene of an HSV2 genome are included as ex- 
perimental controls (the map from wh ich the coordinates 
are based is found at GenBank Accession number 40 
K01408, HSVgD2 gene, and the HSVgD RNAs used in 
this study map to coordinates 313-872). 
[01 56] Cells used in these studies are transfected with 
an expression plasmid encoding a gene product known 
to Interfere with the dsRNA Induced Interferon response 
or with the PKR response, as described above. The cells 
are transfected with lipofectamine (Gibco-BRL) as a 
transfecting agent according to the manufacturer's in- 
structions. 

[0157] Two days after transfection, the cells are har- so 
vested and seeded into six-well plates and cultured to 
approximately 80 to 90% confluence. Cells are co-trans- 
fected with the T7 RNA polymerase expression plasmid 
and one of the RNA expression plasmids, such that one 
well of cells receives the T7 RNA polymerase expres- 55 
sion plasmid and the gag sense RNA expression plas- 
mid; one well of cells receives the T7 RNA polymerase 
expression plasmid and the gag antisense RNA expres- 



sion plasmid; one well of cells receives the T7 RNA 
polymerase expression plasmid and the gag dsRNA ex- 
pression plasmid; one well of cells receives the T7 RNA 
polymerase expression plasmid and the HSVgd sense 
RNA expression plasmid; one well of cells receives the 
T7 RNA polymerase expression plasmid and the HSVgd 
antisense RNA expression plasmid; and one well of 
cells receives the T7 RNA polymerase expression plas- 
mid and the HSVgD dsRNA expression plasmid. Trans- 
fection is again mediated through lipofectamine (Gib- 
co-BRL). There also is a control group of cells receiving 
no RNA. The cells are monitored for p24 synthesis over 
the course of several weeks. The cells are assayed both 
by measuring p24 in the media of cells (using the p24 
ELISA kit from Coulter, according to the manufacturer's 
instructions) and by immunostaining fixed cells for p24 
using a rabbit polyclonal anti-p24 sera and anti-rabbit 
IgG that is FITC conjugated (Sigma). None of the gD 
RNAs specifically shut down p24 synthesis. The double 
stranded gag RNA significantly down regulates p24. 
The sense and antisense have only a modest effect on 
p24 synthesis and some of the effect is predicted to be 
through the ability of the sense and antisense gag RNAs 
to generate low levels of dsRNA species. 

Example 10: Downregulation of PSA expression in 
human Rhabdomyosarcoma cells using intracellular 
expression of dsRNA 

[0158] RD cells transiently expressing prostate spe- 
cific antigen (PSA) were transfected with a T7 RNA 
polymerase expression vector and T7 RNA expression 
vectors expressing PSA dsRNA, PSA sense RNA, PSA 
antisense RNA, or control RNAs. The ability of the ex- 
pressed RNAs to downregulate PSA expression was 
assessed, as described further below. 

Creation of a transient PSA expression line 

[0159] The ability to downregulate expression of PSA 
following the expression of a PSA specific double- 
stranded RNA (dsRNA) was demonstrated in a human 
rhabdomyosarcoma cell line. Since available PSA cell 
lines are difficult to work with (i.e., they are hard to trans- 
fect, and the cells tend to clump), a human cell line tran- 
siently expressing PSA was created. To create these 
cells, human rhabdomyosarcoma cells were transiently 
transfected with a PSA plasmid-based expression vec- 
tor, under conditions that result in transfection of greater 
than 95% of the cells. Transfection was mediated with 
lipofectamine transfecting reagent (Gibco-BRL) accord- 
ing to the manufacturer's instructions. Expression of 
PSA was directed by the HCMV-IE promoter and the 
SV40 poryadenylation signal (Fig. 2). PSA expression 
was measured in the supernatant of transfected cells 
using a PSA ELISA kit (Oncogene Science Diagnostics, 
Cambridge, MA). No PSA was detected in the untrans- 
fected parental cells while PSA was abundantly ex- 
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pressed in cells receiving the PSA expression vector. 

Downregulation of PSA expression 

[0160] The PSA expressing cell line was used as a 
model system with which to demonstrate the downreg- 
ulation of PSA protein levels by PTGS. In these studies, 
plasmids encoding an approximately 600 nt sense RNA, 
a 600 nt antisense RNA or a 600 nt dsRNA derived from 
a PSA cDNA were used to transfect the PSA expressing 
cell line (Fig. 2). Expression of the RNAs was from a T7 
RNA polymerase promoter(s) located at the 5* end of 
the PSA sense strand, at the 5' end of the PSA antisense 
strand, or at converging positions at the 5' ends of both 
the sense and antisense strands respectively (Fig. 2). 
These encoded RNAs are not designed to be able to 
make protein (they do not have a cap, or a poly A tail). 
Transcription of the RNAs was catalyzed by T7 RNA 
polymerase, provided by a co-transfected T7 RNA ex- 
pression plasmid. Control plasmids expressing similar 
sized sense RNA, antisense RNA, and dsRNA derived 
from the glycoprotein D (gD) gene of an Herpes simplex 
2 (HSV-2) genome, as described above were included 
as experimental controls. 

[0161] Cells used in these studies can optionally be 
transfected with an expression plasmid encoding a gene 
product known to interfere with the dsRNA induced in- 
terferon response or with the PKR response, as de- 
scribed above. The cells are transfected with lipo- 
fectamine (Gibco-BRL) as a transfecting agent accord- 
ing to the manufacturer's instructions. 
[0162] Human rhabdomyosarcoma cells were seed- 
ed into six-well plates and cultured to approximately 80 
to 90% confluence. The cells were co-transfected with 
(A) the PSA expression plasmid; (B) one of the T7 RNA 
expression plasmids; and (C) the T7 RNA polymerase 
expression plasmid, such that all PSA expressing cells 
were transfected with the T7 RNA polymerase expres- 
sion plasmid and one of the following: the T7 sense PSA 
RNA expression construct, the T7 antisense PSA RNA 
expression construct, the T7 dsRNA PSA expression 
construct, the sense HSVgD RNA expression construct, 
the antisense HSVgD expression construct, or the dsR- 
NA HSVgd expression construct. Cells received identi- 
cal amounts of the PSA expression plasmid and the T7 
RNA expression plasmid. The amounts of the T7 RNA 
expression plasmids were also constant amongst the 
transfected cells. Total DNA per transfection was held 
constant at 2.5 u,g DNA per one well of a six-well plate. 
In those transfections where there was no T7 RNA ex- 
pression plasmid, an inert plasmid DNA was used as 
filler DNA. Transfection was mediated by lipofectamine 
(Gibco-BRL) according to the manufacturer's instruc- 
tions. There was also a control group of untransfected 
cells, as well as an untreated PSA control group of cells 
transfected with only the PSA expression plasmid in 
combination with the T7 RNA polymerase expression 
plasmid. 



[0163] PSA-expressing cells that were not transfect- 
ed with the T7 RNA expression plasmid, as well as cells 
transfected with the T7 HSV2-gDRNA expression plas- 
mid all expressed PSA abundantly and at comparable 
5 levels. Cells transfected with the sense, antisense : and 
ds PSA RNA expression plasmids all exhibited varying 
degrees of inhibition of PSA expression. A 5%-10% re- 
duction in expression was seen in cells expressing the 
PSA sense RNA, a 50% reduction was seen in cells ex- 
'0 pressing the PSA antisense RNA and greater than 95% 
reduction was seen in the cells expressing the PSA dsR- 
NA (Fig. 3). The inhibition was seen within two days after 
transfection and continued up until the last time point 
taken (one month later) at which point PSA levels were 
'5 beginning to decline in the untreated cells and the ex- 
periment was terminated. The untreated PSA controls 
as well as cells transfected with the T7 HSV2-gD control 
RNA expression plasmids all expressed PSA abundant- 
ly and at comparable levels (Fig. 3), indicating the spe- 
cificity of dsRNA effectors to silence gene expression. 
During the one of month culture, cells were expanded 
into larger cultures at routine intervals. 
[0164] Although the PSA specific ds RNA induced sig- 
nificant inhibition of PSA expression, antisense and 
sense PSA RNAs also induced some level of inhibition. 
Antisense PSA RNA has the potential to form dsRNA 
by annealing with PSA mRNA. Therefore the inhibition 
seen with antisense RNA may be explained by both an 
antisense mechanism and a dsRNA induced inhibition. 
A critical intracellular concentration of both antisense 
RNA and mRNA is required to generate dsRNA. Since 
much less dsRNA is made in the antisense RNA ex- 
pressing cells relative to those cells designed to make 
dsRNA, a lesser inhibition of PSA in the antisense RNA 
expressing cells is expected if the threshold dsRNA lev- 
els required for efficient silencing have not been 
reached in those cells. We have also demonstrated that 
a small amount of antisense RNA can be found in cells 
transfected with our expression vectors (approximately 
0.2% the amount of mRNA steady state levels). Anti- 
sense expression is presumably driven by cryptic pro- 
moters on the non-coding plasmid DNA strand. The ob- 
served sense RNA inhibition could therefore also in- 
volve a dsRNA molecule. RNA from transfected but un- 
treated ceils could also be analyzed to determine If the 
low level expression of antisense RNAs in these cells 
results in the production of detectable dsRNA species. 
Some low level expression of PSA occurred in cells ex- 
pressing PSA dsRNA. It is likely that some percentage 
of cells did not take up the dsRN A expression cassette 
or that the threshold levels of dsRNA were not reached 
in some cells. No cellular toxicity was seen with any of 
the dsRNAs generated by the RNA expression vectors 
suggesting that cytoplasmic expression of dsRNA does 
not induce the interferon response. In contrast, cell 
death is induced when certain concentrations of in vitro 
produced dsRNA is delivered to cells via transfection 
with certain cationic lipids. 
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[01 65] In summary, these results indicate that (i) PSA 
derived dsRNA is much more efficient than PSA anti- 
sense RNA in down-regulating PSA expression, (ii) the 
down-regulation is sequence specific; only the PSA de- 
rived dsRNA and not the control HSV-2 derived dsRNA s 
induced down-regulation of PSA, and (iii) there is no tox- 
icity associated with the cytoplasmic expression of long 
(600 bp) dsRNA molecules. Additionally, these experi- 
ments are the first demonstration of dsRNA mediated 
down-regulation of gene expression in a human cell line. 10 

Example 11: Intracellular expression of dsRNA does not 
induce the type 1 interferon response (RNA stress 
response). 

15 

[0166] Human rhabdomyosarcoma (RD) cells were 
transfected with various dsRNA expression vectors 
such that dsRNA was transcribed in the transfected cells 
as described in Example 1 0. Transcription of dsRNA oc- 
curred in the cells within 24 hours after transfection and 20 
continued for the duration of the thirty day experiment. 
Cells and the supematants from transfected cells were 
analyzed during the course of the experiment for any 
evidence of RNA stress response induction. No evi- 
dence of RNA stress response induction by intracellular 25 
expressed dsRNA was observed. RD cells have been 
shown by us to be responsive to type 1 interferon, both 
alpha and beta, and thus RD cells are capable of mount- 
ing an RNA stress response. In addition, positive con- 
trols were included in these experiments. A positive con- 30 
trol for these experiments is a method of delivering dsR- 
NA which induces the RNA stress response. All positive 
controls induced the RNA stress response. These ex- 
periments are described further below. 

35 

Assays performed to identify RNA stress response 
induction 

[01 67] The following assays were performed to meas- 
ure the induction of an RNA stress response: TUNEL *o 
assay to detect apoptotic cells, ELISA assays to detect 
the induction of alpha, beta and gamma interferon, ri- 
bosomal RNA fragmentation analysis to detect activa- 
tion of 2'5'OAS, measurement of phosphorylated eiF2a 
as an Indicator of PKR (protein kinase RNA inducible) 
activation, proliferation assays to detect changes in cel- 
lular proliferation, and microscopic analysis of cells to 
identify cellular cytopathic effects. Apoptosis, interferon 
induction, 2'5* OAS activation, PKR activation, anti-pro- 
liferative responses, and cytopathic effects are all ind'n so 
cators for the RNA stress response pathway. 

Transfection of cells 

[01 68] Approximately 7x1 0 5 RD cells were seeded in- 55 
to individual wells of six-well plates. Cells were trans- 
fected when they reached about 90% confluency. Cells 
were transfected with aT7 RNA polymerase expression 



construct and a T7 dsRNA expression construct. The 
T7 dsRNA expression constructs encode converging T7 
promoters located on either side of a 600 bp sequence 
(Fig. 2). Controls included cells transfected with the T7 
RNA expression construct alone so that no dsRNA is 
made in these cells. Total DNA per transfection was held 
constant at 2.5 ug DNA per one well of a six-well plate. 
When the T7 RNA polymerase and T7 dsRNA expres- 
sion vectors were both used, 1 .25 ug of each DNA was 
used per transfection. In those transfections where 
there was no T7 dsRNA expression construct, inert filler 
DNA was used to bring the total DNA to 2.5 u.g per trans- 
fection. Transfection was mediated using Lipofectamine 
(InVitrogen) according to the manufacturer's directions. 
The positive control transfections included poly(l)(C) 
RNA and in vitro transcribed dsRNA of 600 bp that were 
both complexed with Lipofectamine and transfected into 
cells. The cells were transfected with in vitro transcribed 
ssRNA complexed with Lipofectamine. 2.5 u.g of each 
RNA was used per transfection. Othercontrols included 
untreated cells. Cells were kept in culture for one month 
by expanding into larger flasks as the cell numbers in- 
creased. 

ELISA assays 

[0169] Supematants were removed from the trans- 
fected and untreated cells at time points of 1 , 2, 7, 17, 
and 48 hours and every several days for up to one month 
after the 48 hour time point. Collected supematants 
were stored at -80°C until they were analyzed for the 
presence of alpha, beta, and gamma interferon using 
commercially available ELISA kits. The Interferon-alpha 
ELISA kit was obtained from ENDOGEN (Rockford, IL), 
the Interferon-Beta ELISA kit was obtained from RD1 
(Flanders, NJ), and the Interferon-gamma ELISA kit was 
obtained from R&D Systems (Minneapolois.MN). ELI- 
SAs were all performed according to the manufacturer's 
directions. Alpha, beta : and gamma interferon were not 
detected at increased levels in cells expressing intrac- 
ellular dsRNA compared to the corresponding levels in 
untreated cells. However, considerable levels of beta in- 
terferon were found in cells transfected with poly (l)(C) 
or with in vitro transcribed dsRNA and ssRNA. Alpha 
and beta interferon induction are associated with induc- 
tion of the RNA stress response. 

TUNEL assay 

[0170] Cells were stained for the presence of apop- 
totic nuclei using a commercially available kit, TdT 
FragEL, DNA Fragmentation Detection Kit, In Situ Ap- 
optosis Assay from Oncogene (Boston, MA). Cells were 
stained according the manufacturer's directions. Cells 
were stained at 2 hours, 7 hours, 17 hours, 2 days, 3 
days, 4 days, and 5 days after transfection. There was 
no evidence of apoptosis induced by intracellular ex- 
pressed dsRNA at any of the time points analyzed. How- 
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ever, the majority of cells transfected with poly (l)(C) or 
with the in vitro transcribed dsRNA were apoptotic by 
17 hours after transfection. No evidence of apoptosis 
was observed in the untreated cells or in cells transfect- 
ed with ssRNA. Apoptosis is an end result of the induc- 
tion of the RNA stress response pathway. 

2'5'OAS activation 

[0171] The activation of 2'5'OAS was determined by 
performing ribosomal RNA fragmentation analysis. 
Briefly, following transfection, total RNA was extracted 
from cells using standard procedures. RNA was extract- 
ed at the following time points: 2 hours, 7 hours, 17 
hours, 48 hours, 3 days, 4 days, and 5 days after trans- 
fection. 5-10 u,g RNA was analyzed for each sample. 
RNA samples were first denatured in formal dehyde/for- 
mamide RNA sample buffer at 65°C for 1 0 minutes prior 
to being electrophoresed through 0.5X TBE agarose 
gels. Ribosomal RNA was visualized by staining with 
ethidium bromide followed by ultraviolet transillumina- 
tion. Ribosomal RNA fragmentation was observed in 
cells transfected with poly (l)(C) and with the in vitro 
transcribed dsRNA. No fragmentation was observed in 
the untreated control cells, cells transfected with ssR- 
NA, or in cells expressing intracellular dsRNA. These 
results indicate that 2'5'OAS was not activated by dsR- 
NA when it was made intracellular^. 2'5'OAS activation 
is associated with induction /activation of the RNA stress 
response pathway. 

PKR activation 

[0172] The activation of PKR was determined by 
measuring the levels of elF2alpha phosphorylation. 
Briefly, cells were lysed at various times after transfec- 
tion (2 hours, 7 hours, 19 hours, 48 hours, 3 days, 4 
days, and 5 days after transfection) and analyzed for 
levels of phosphorylated and non-phosphorylated elF2 
alpha. The protocol for lysing cells can be found in the 
following reference: Zhang R et a/., J. Biol. Chem. 276 
(27):24946-58, 2001. This analysis was performed as 
described for detecting PKR phosphorylation except 
that antibodies specific for phosphorylated and non- 
phosphorylated eiF2alpha were used. These antibodies 
are available from Cell Signaling Technology ( Beverly, 
MA). 

Cytopathic effect and antiproliferative responses 

[0173] Cytopathic effect was assayed by anaryzing 
cells microscopically using a light microscope. Ceils 
were analyzed at daily intervals throughout the course 
of the experiment. Cytopathic effect is defined as any or 
all of the following: cells detaching from surface of well/ 
flask, cells rounding up, an increased number of vacu- 
oles in transfected cells with respect to the control un- 
treated cells, or differences in morphology of cells with 



respect to the untreated control cells. No cytopathic ef- 
fect was seen in those cells expressing dsRNA intrac- 
ellularty. Severe cytopathic effects were seen in cells 
transfected with Poly (l)(C) or with dsRNAmade in vitro. 
5 Cytopathic effect is associated with the RNA stress re- 
sponse. 

[0174] Antiproliferative responses were assayed by 
measuring the division rate of cells. The division rate is 
determined by counting cell numbers using standard 
10 procedures. Cells were counted every few days for the 
duration of the experiment. No antiproliferative respons- 
es were seen in cells expressing dsRNA intracellulary. 
Antiproliferative responses are associated with the RNA 
stress response. 

15 

Summary of results 

[0175] The results of the above assays indicate that 
intracellular expression of dsRNA does not induce the 

20 RNA stress response. The cells that were used for these 
experiments were competent for RNA stress response 
induction as was demonstrated by the ability of cationic 
lipid complexed poly(l)(C) and in vitro transcribed RNA 
to induce/activate all tested components of this re- 

^5 sponse. In addition, the cells were found to be respon- 
sive to exogenously added interferon. These results im- 
ply that the cells used for these experiments are not de- 
fective in their ability to mount an RNA stress response 
and therefore can be used as predictors for other cells, 

30 both in cell culture and in vivo in animal models. This 
method described here, which does not induce the in- 
terferon stress response, has been found to induce 
PTGS. This method therefore provides a method to in- 
duce PTGS without inducing an undesired RNA stress 

35 response. 

[0176] Although these results were generated using 
a vector that utilizes a T7 transcription system and there- 
fore expresses dsRNA in the cytoplasm, the vector sys- 
tem can be changed to other systems that express dsR- 

40 NA intracellular^. Similar results are expected with 
these expression systems. These systems include, but 
are not limited to, systems that express dsRNA or hair- 
pin RNAs in the nucleus, in the nucleus followed by 
transport of the RNAs to the cytoplasm, or in the cyto- 

45 plasm using non-T7 RNA polymerase based expression 
systems. 

Example 12: Optimization of the concentrations and 
relative ratios of in vitro or in vivo produced dsRNA and 
so delivery agent 

[0177] The optimal concentrations and ratios of dsR- 
NA to a delivery agent such as a cationic lipid, cationic 
surfactant, or local anesthetic can be readily determined 
55 to achieve low toxicity and to efficiently induce gene si- 
lencing using in vitro or in vivo produced dsRNA. 
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Summary of factors effecting nucleic acid/cationic lipid 
interactions 

[0178] Cationic lipid DNA interactions are electrostat- 
ic. Electrostatic interactions are highly influenced by the 
ionic components of the medium. The ability to form sta- 
ble complexes is also dependent upon the intermolecu- 
lar interactions between the lipid molecules. At low con- 
centrations, certain inter-lipid interactions are preferred; 
at higher lipid concentrations, rapid condensates are 
formed due to higher order interactions. Although local 
interactions are similar in both of these instances (e.g., 
phosphoryl groups in the DNA and the charged cationic 
head group), the long range and inter-lipid interactions 
are substantially different. Similarly, structurally diverse 
variants can be obtained simply by changing the charge 
ratio of the complex by mixing varying amounts of cati- 
onic lipid with fixed concentrations of the nucleic acid or 
vice versa. This variation in the structure of the complex- 
es is evidenced by altered physical properties of the 
complexes (e.g., differences in octanol partitioning, mo- 
bility on density gradients, charge density of the particle, 
particle size, and transferability of cells in culture and 
in vivo) (Pachuk etal DNA Vaccines - Challenges in De- 
livery, Current Opinion in Molecular Therapeutics, 2(2) 
188-ig8, 2000 and Pachuk et at., BBA, 1468, 20-30, 
(2000)). Furthermore, different lipids, local anesthetics, 
and surfactants differ in their interactions between them- 
selves, and therefore novel complexes can be formed 
with differing biophysical properties by using different li- 
pids singularly or in combination. For each cell type, the 
following titration can be carried out to determine the 
optimal ratio and concentrations that result in complex- 
es that do not induce the stress response or interferon 
response. At several of these concentrations PTGS is 
predicted to be induced; however, PTGS is most readily 
observed under conditions that result in highly dimin- 
ished cytotoxicity. 

Complex formation 

[01 79] dsRNA is either produced by in vitro transcrip- 
tion using the T7 promoter and polymerase or another 
RNA polymerase, such as an E. colt RNA polymerase. 
dsRNA can also be produced In an organism or cell us- 
ing endogenous polymerases. 

[0180] Concentrations of dsRNA, such as PSA-spe- 
cific dsRNA, are varied from 1 pg to 10 ng. In some in- 
stances, 150 ng of a plasmid that encodes a reporter of 
interest (PSA) to be silenced may be comixed at a con- 
centration between 10 ng and 10 ng. The concentration 
of cationic lipid, cationic surfactant, local anesthetic, or 
any other transfection facilitating agent that interacts 
with the nucleic acid electrostatically are varied at each 
of the dsRNA concentrations to yield charge ratios of 
0.1 to 1 000 (positive / negative) (i.e., the ratio of positive 
charge from lipids or other delivery agents to negative 
charge from DNA or RNA). The complexes are prepared 



in water or in buffer (e.g., phosphate, HEPES, citrate, 
Tris-HCI, Tris-glycine, malate, etc. at pH values that 
range from 4.0 to 8.5), may contain salt (e.g., 1 - 250 
mM), and may contain glycerol, sucrose, trehalose, xy- 

5 lose, or other sugars (e.g., mono-, di-, or polysaccha- 
ride). The mixture is allowed to sit at room temperature, 
desirably for 30 minutes, and may be stored indefinitely. 
The complexes are premixed in serum free media. The 
nucleic acid and the transfecting reagent may be mixed 

to either through direct addition or through a slow mixing 
process, such as across a dialyzing membrane or 
through the use of a microporous particle or a device 
that brings the two solutions together at a slow rate and 
at low concentrations. In some instances, the two inter- 

15 acting components are mixed at low concentrations, 
and the final complex is concentrated using a diafiftera- 
tion or any other concentrating device. Alternatively, if 
the complexes are formed at high concentrations of ei- 
ther or both of the interacting components, the complex- 
20 es may be diluted to form an ideal transfection mixture. 

Transfection protocol and analysis of dsRNA stress 
response 

25 [0181] Complexes are added to cells that are -60 
-80% confluent in serum free media. The complexes are 
incubated for various times (e.g., 10 minutes to 24 
hours) with the cells at 37°C and diluted with serum con- 
taining media or washed and replated in serum free me- 

^o dia. The cells are monitored for toxicity and analyzed at 
various times for signs of dsRNA response (e.g., TUN- 
NEL assay to detect nicked DNA, phosphorylation of 
EIF2alpah, induction and activation of 2'5' OAS, or in- 
terferon-alpha and -beta). Transfection conditions that 

35 result in less than 50% : 25%, 1 0%, or 1 % cytotoxicity or 
that result in a less than 20, 1 0, 5, 2, or 1 .5-fold induction 
of a stress response are analyzed to determine if PTGS 
was efficiently induced. 



40 Determining induction of PTGS 

[0182] PSA protein levels are determined in cell cul- 
ture media using standard methods. The data is normal- 
ized to the number of live cells in culture to determine 
45 the concentrations required to Induce PTGS. 

Results 

[0183] Using the above method, cationic lipid com- 
50 piexes of dsRNA induced toxicity at certain ranges. With 
lipofectamine as the cationic lipid, positive to negative 
charge ratios greater than 10 did not produce any de- 
tectable toxicity at any of the concentrations of dsRNA 
tested and induced a high level of PTGS, resulting in 
55 highly decreased levels of PSA in the culture medium. 
The RNA concentration ranges tested were 1 pg to 1 00 
ng with a constant amount of lipofectamine (10 uL of a 
2 mgfmL solution from GIBCO-BRL Life Technologies, 
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Bethesda, MD). 

Example 13: Method to avoid dsRNA mediated 
activation of the RNA stress response pathway 

5 

[0184] One or more components of the RNA stress 
response pathway can be mutated or inactivated to 
avoid induction/activation of the component(s) by dsR- 
NA that is delivered to the cell or animal for the purpose 
of inducing PTGS. These components, such as those io 
illustrated in Fig. 4, can be knocked out singularly or in 
combination. 

[0185] Various standard methods can be used to 
knockout components of the RNA stress response path- 
way, such as PKR, human beta interferon Accession No. * 5 
M25460), and/or 2 , 5 , OAS (Accession No. NIVL003733). 
Alternatively or additionally, one or more interferon re- 
sponse element (IRE) sequences can be mutated or de- 
leted using a knockout construct designed based on the 
IRE consensus sequence (Ghislain, et a/., J Interferon 20 
Cytokine Res. 2001 Jun 21(6):379-88.), and/or one or 
more transcription factors that bind IRE sequences, 
such as STAT1 (Accession number XM_010893), can 
be mutated or deleted. These methods include the use 
of antisense DNA/RNA, ribozymes, or targeted gene 25 
knockout technology mediated by homologous recom- 
bination. One skilled in the art is able to design the ap- 
propriate antisense sequences, ribozymes, and vectors 
for targeted knockouts. For example, targeted knock- 
outs may be prepared by any of the following standard 30 
methods: Shibata era/., Proc Natl Acad Sci USA. 2001 
Jul 17;98(15):8425-32. Review., Muyrers et aL, Trends 
Biochem ScL 2001 May;26(5):325-31., Paul etal., Mutat 
Res. 2001 Jun 5;486(1 ):1 1 -9., Shcherbakova et a/., Mu- 
tatRes.2000 Feb16;459(1):65-71.,Lantsov. Idealgene 35 
therapy: approaches and prospects Mol Biol (Mosk). 
1 994 May-Jun;28(3):485-95., in Gene Transfer and Ex- 
pression - A Laboratory Manual editor: Michael Kriegler, 
Publisher- WH Freeman & Co, New York, NY, pages 56 
-60,1990). 40 
[0186] Knockout cells can be readily identified either 
through the use of an antibiotic resistance marker which 
when transferred to the chromosome confers resistance 
to the cell or through the use of dsRNA itself. In partic- 
ular, dsRNA (e.g., a high concentration of dsRNA) In- *5 
duces apoptosis in wild-type cells while mutant cells sur- 
vive dsRNA treatment because they cannot mount a 
stress response. Yet another approach involves per- 
forming the dsRNA-induced PTGS experiment in the 
presence of large concentrations of IRE (dsDNA) oligo, so 
which is expected to titrate activated STAT proteins. 
These oligos can be delivered intracerllularly using 
transfecting agents or electroporation. 
[01 87] In another method of preventing the interferon 
response, cells (e.g., RD cells) are transf ected with a T7 55 
RNA polymerase expression vector and a T7 dsRNA ex- 
pression vector encoding dsRNA homologous to the hu- 
man protein kinase PKR cDNA (accession number 



M35663) or homologous to the coding sequence of any 
other component in the RNA stress response pathway. 
In one particular example, dsRNA corresponding to nu- 
cleotides 190 -2000 is encoded by the T7dsRNA ex- 
pression vector. The expression vectors are similar to 
those described in Example 10 and shown in Fig. 2, ex- 
cept that the dsRNA encoding sequence is derived from 
the human protein kinase PKR cDNA. Transfection in 
RD cells is performed as described in Example 1 0. With- 
in 2-5 days post-transfection, the cells are functionally 
PKR negative. 

Other Embodiments 

[0188] From the foregoing description, it will be appar- 
ent that variations and modifications may be made to 
the invention described herein to adopt It to various us- 
ages and conditions. Such embodiments are also within 
the scope of the following claims. 
[0189] AM publication, patents, and patent applica- 
tions mentioned in this specification are herein incorpo- 
rated by reference to the same extent as if each individ- 
ual publication, patent, or patent application was specif- 
ically and individually indicated to be incorporated by 
reference. 



Claims 

1. A method for identifying a nucleic acid that modu- 
lates the function of a vertebrate cell, said method 
comprising the steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 
cleic acid from said double stranded RNA ex- 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 
out under conditions that inhibit or prevent an 
interferon response or a double stranded RNA 
stress response; 

(b) selecting a vertebrate cell in which said nu- 
cleic acid is expressed in said cell; and 

(c) assaying for a modulation in the function of 
said cell, wherein said modulation identifies a 
nucleic acid that modulates the function of said 
vertebrate cell. 

2. A method for identifying a nucleic acid that modu- 
lates expression of a target nucleic acid in a verte- 
brate cell, said method comprising the steps of: 
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(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 
cleic acid from said double stranded RNA ex- 5 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 
out under conditions that inhibit or prevent an 10 
interferon response or a double stranded RNA 
stress response; 

(b) selecting a vertebrate cell in which said nu- 
cleic acid is expressed in said cell; and is 

(c) assaying for a modulation in the expression 
of a target nucleic acid in said cell, wherein said 
modulation identifies a nucleic acid that modu- 
lates expression of said target nucleic acid. 20 

3. A method for identifying a nucleic acid that modu- 
lates the biological activity of a target polypeptide 
in a vertebrate cell, said method comprising the 
steps of: 25 

(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 30 
cleic acid from said double stranded RNA ex- 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 35 
out under conditions that inhibit or prevent an 
interferon response or a double stranded RNA 
stress response; 

(b) selecting for a vertebrate cell in which said *o 
nucleic acid is expressed in said cell; and 

(c) assaying for a modulation in the biological 
activity of a target polypeptide in said cell, 
wherein said modulation identifies a nucleic ac- 45 
Id that modulates the biological activity of said 
target polypeptide. 

4. The method of claim 1 , wherein at most one nucleic 
acid is stably integrated into a chromosome of each so 
cell. 

5. The method of claim 1 , said method further com- 
prising: 

55 

(d) identifying said nucleic acid by amplifying 
said nucleic acid and sequencing said amplified 
nucleic acid. 



6. The method of claim 1 , wherein said double strand- 
ed RNA expression library comprises cDNAs de- 
rived from said cells. 

7. The method of claim 1 , wherein said double strand- 
ed RNA expression library comprises randomized 
nucleic acids. 

8. The method of claim 1 , wherein said double strand- 
ed RNA expression library is a nuclear double 
stranded RNA expression library. 

9. The method of claim 1 , wherein said double strand- 
ed RNA expression library is a cytoplasmic double 
stranded RNA expression library. 

10. The method of claim 1 , wherein said cell is a mam- 
malian cell. 

11. The method of claim 10, wherein said cell is a hu- 
man cell. 

12. The method of claim 1 0, wherein said cell is select- 
ed from the group consisting of a cancer cell, a cell 
of the immune system, a stem cell, a neuronal cell, 
a muscle cell, and an adipocyte. 

13. The method of claim 1 , wherein said nucleic acid is 
contained in a vector. 

14. The method of claim 13, wherein said vector com- 
prises an RNA polymerase II promoter, an RNA 
polymerase I promoter, an RNA polymerase ill pro- 
moter, or a mitochondrial promoter. 

15. The method of claim 13, wherein the sense strand 
and the anti-sense strand of said double stranded 
RNA are transcribed from the same nucleic acid us- 
ing two convergent promoters. 

16. The method of claim 13, wherein said nucleic acid 
comprises an inverted repeat such that upon tran- 
scription said nucleic acid forms a double stranded 
RNA. 

1 7. The method of claim 1 , wherein said assaying com- 
prises measuring an event selected from the group 
consisting of cell motility, apoptosis, cell growth, cell 
invasion, vascularization, cell cycle events, cell dif- 
ferentiation, cell dedifferentiation, neuronal cell re- 
generation, and the ability of a cell to support viral 
replication. 

1 8. The method of claim 1 , wherein said double strand- 
ed RNA is between 5 and 100 nucleotides in length, 
inclusive. 

19. The method of claim 1 , wherein said double strand- 
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ed RNA is at least 100 nucleotides in length. 

20. The method of claim 19, wherein said double 
stranded RNA is at least 250 nucleotides in length: 

21. The method of claim 20, wherein said double 
stranded RNA is at least 500 nucleotides in length. 

22. The method of claim 21, wherein said double 
stranded RNA is at least 1000 nucleotides in length. 

23. A method for identifying a nucleic acid that modu- 
lates the function of a vertebrate cell, said method 
comprising the steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 
a double stranded RNA stress response; 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 

(c) assaying for a modulation in the function of 
said cell, wherein said modulation identifies a 
nucleic acid that modulates the function of said 
cell. 

24. A method for identifying a nucleic acid that modu- 
lates expression of a target nucleic acid in a verte- 
brate cell, said method comprising the steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 
a double stranded RNA stress response; 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 

(c) assaying for a modulation in the expression 
of a target nucleic acid in said cell, wherein said 
modulation identifies a nucleic acid that modu- 
lates expression of said target nucleic acid. 

25. A method for identifying a nucleic acid that modu- 
lates the biological activity of a target polypeptide 
in a vertebrate cell, said method comprising the 
steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 
a double stranded RNA stress response; 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 



(c) assaying for a modulation in the biological 
activity of a target polypeptide in said cell, 
wherein said modulation identifies a nucleic ac- 
id that modulates the biological activity of said 
5 target polypeptide. 

26. The method of claim 23, said method further com- 
prising: 

™ (d) identifying said nucleic acid by amplifying 

said nucleic acid and sequencing said amplified 
nucleic acid. 

27. The method of claim 23, wherein said double 
*5 stranded RNA is derived from said cells. 

28. The method of claim 23, wherein said double 
stranded RNA is between 5 and 100 nucleotides in 
length, inclusive. 

20 

29. The method of claim 23, wherein said double 
stranded RNA is at least 100 nucleotides in length. 

30. The method of claim 29, wherein said double 
25 stranded RNA is at least 250 nucleotides in length. 

31. The method of claim 30, wherein said double 
stranded RNA is at least 500 nucleotides in length. 

30 32. The method of claim 31, wherein said double 
stranded RNA is at least 1 000 nucleotides in length. 

33. The method of claim 23, wherein each cell is trans- 
formed with at most one double stranded RNA. 

35 

34. The method of claim 23, wherein said cell is a mam- 
malian cell. 

35. The method of claim 34, wherein said cell is a hu- 
40 man cell. 

36. The method of claim 23, wherein said population is 
transformed with at least 100 different double 
stranded RNAs. 

45 

37. The method of claim 1 or 23, said method further 
comprising: 

(d) identifying said nucleic acid that modulates 
so the function of said cell. 

38. The method of claim 2 or 24, said method further 
comprising: 

55 (d) identifying said nucleic acid that modulates 

expression of said target nucleic acid. 

39. The method of claim 3 or 25, said method further 
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comprising: 

(d) identifying said nucleic acid that modulates 
the biological activity of said target polypeptide. 
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Figure 4 
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